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Pulsing  media  expenditure  schedules,  i.e.  a few 
concentrated  bursts  of  advertising  interspersed  with  periods 
of  no  advertising,  are  widely  used  in  practice.  However, 
contrary  to  the  managerial  practice,  academic  research  has 
shown  that  a uniform  advertising  spending  pattern  over  time 
is  the  optimal  scheduling  strategy.  This  begs  the  question: 
why  do  advertisers  use  pulsing  schedules  in  practice? 

This  dissertation  argues  that  previous  models  have 
ignored  a key  empirical  feature:  advertising  wearout,  i.e.  a 
decline  in  the  quality  (effectiveness)  of  advertisements. 

The  assumption  of  constant  quality  of  advertising  in 
previous  studies  has  resulted  in  smoothing  out  the  media 
expenditure  patterns  over  time;  hence,  a uniform  spending 
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pattern  has  emerged  as  the  optimal  schedule  in  the  previous 
studies . 

In  this  dissertation  I incorporate  the  role  of 
advertising  wearout  phenomenon.  I show  that  the  quality  of 
an  advertisement  evolves  over  time.  In  particular,  ad 
quality  declines  when  advertising  is  on,  and  it  restores 
during  media  hiatus.  Hence,  it  behooves  an  advertiser  to 
stop  advertising  after  a while  when  ad  quality  gets  quite 
low,  and  wait  until  it  restores  to  some  meaningful  level 
before  starting  the  next  advertising  burst.  Using  the 
proposed  model,  the  advertisers  can  determine  the  optimal 
duration  of  advertising  bursts,  the  spending  intensity,  as 
well  as  the  spacing  between  bursts  to  achieve  maximum 
revenue . 

The  model  was  tested  on  real  advertising  data  from 
awareness  tracking  studies  and  was  found  to  be  superior, 
theoretically  and  statistically,  to  several  benchmark  models 
from  the  literature.  I have  illustrated  how  to  determine 
empirically  the  rate  at  which  advertisements  wear  out,  the 
magnitudes  of  forgetting  and  carry-over  effects,  and  other 
parameters  of  the  dynamic  model.  A decision  support  system 
is  designed  to  find  the  best  pulsing  schedule. 
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CHAPTER  1 

ADVERTISING  WEAROUT  AND  PULSING  MEDIA  SCHEDULES 

Introduction 

An  important  empirical  generalization  in  marketing  is 
that  increased  advertising  spending  does  not  necessarily 
increase  sales  (Lodish  et  al.  1995).  Research  pointing 
toward  this  conclusion  has  been  appearing  for  the  last 
thirty  years  (e.g.  Buzzell  1964,  Eastlack  and  Rao  1984, 
Abraham  and  Lodish  1990,  Adams  and  Blair  1992,  Blair  and 
Rosenberg  1994) . This  robust  finding  can  be  attributed  at 
least  partially  to  the  phenomenon  of  advertising  wearout 
i.e.,  a decline  in  the  quality  of  an  advertisement  (Grass 
and  Wallace  1969,  Ray  and  Sawyer  1971,  Greenberg  and  Suttoni 
1973,  Calder  and  Sternthal  1980,  Kamins  1981,  Pechmann  and 
Stewart  1990) . However,  despite  this  phenomenon  being  well- 
recognized  in  advertising  practice,  there  are  only  a few 
models  of  optimal  advertising  strategy  (e.g.  Pekelman  and 
Sethi  1978)  which  explicitly  incorporate  advertising  wearout 
in  their  formulation. 

The  theoretical  link  between  advertising  wearout  and 
media  scheduling  was  recognized  by  Ray  (1975,  p.51)  in  his 
review  of  potential  linkages  and  problems  of  advertising. 

But  there  is  very  little  theoretical  research  that  provides 
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directions  on  planning  optimal  advertising  spending  patterns 
in  the  presence  of  advertising  wearout  (Percy  and  Rossiter 
1980,  Ch.  6) . Such  research  is  needed  to  resolve  a host  of 
important  managerial  questions  for  which  there  are  few  good 
answers  in  the  literature.  For  example,  Corkindale  and 
Kennedy  (1975,  p.75)  have  noted: 

Little  established  knowledge  appears  to  exist  to 

formulate  specific  principles  on: 

• frequency  effects  — how  should  advertisements  be 
scheduled  to  appear? 

• the  existence  of  precise  thresholds  of  expenditure 
— is  there  a limit  below  which  it  is  not  worth 
spending? 

• the  rate  at  which  advertisements  'wear  out'  — how 
many  different  advertisements  should  be  made  for  a 
campaign? 

• the  relative  merits  of  'continuous'  versus 
'concentrated  bursts'  of  advertising  — do  we 
periodically  make  a big  'impact'  or  have  a 
'continuous  presence'? 

Some  of  these  issues  raised  by  Corkindale  and  Kennedy 
(1975)  have  been  investigated  in  the  last  two  decades.  In 
particular,  the  question  of  when  pulsing  media  schedules  can 
be  superior  to  continuous  spending  has  received  a fair 
degree  of  attention  (e.g.,  Simon  1982,  Luhmer  et  al.  1988, 
Mesak  1985,  1992,  Mahajan  and  Muller  1986,  Park  and  Hahn 
1991,  Hahn  and  Hyun  1991,  Feinberg  1992,  Villas  Boas  1993) . 
However,  the  associated  questions  of  what  is  the  optimal 
spacing  and  timing  for  pulsing  media  schedules  are  still 
largely  unresolved  (e.g.,  Strong  1977,  Zielske  and  Henry 
1980,  Simon  1982,  Mesak  1992) . 
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In  this  dissertation  I focus  on  the  following 
questions : 

1.  How  can  advertisers  determine  the  rate  at  which 
advertisements  wear  out? 

2.  Is  there  an  optimal  number  of  bursts  that  should  be 
planned  in  a given  year? 

3.  How  long  should  each  advertising  burst  last? 

4.  What  should  be  the  spacing  between  bursts? 

5.  Should  the  length  of  advertising  burst  and  the 
spacing  between  bursts  be  equally  long? 

The  relevance  and  importance  of  these  questions  to 
managers  were  established  after  several  conversations  with 
leading  advertising  agencies,  advertisers,  and  marketing 
consulting  firms.  A summary  of  insights  and  impressions  from 
these  conversations  is  noted  in  Appendix  A.  The  extant  trade 
literature  supports  the  conclusion  that  these  are  pressing 
concerns  for  advertisers  (see  the  Advertising  Manager's 
Handbook,  Bly  1993,  p.251-254) . A recent  survey  among  media 
directors  of  leading  U.S.  advertising  agencies  also 
highlights  the  need  to  resolve  the  issues  raised  above.  For 
example,  a survey  indicates  that  a majority  («  96%)  of  the 
respondents  would  welcome  improvement  in  their  agency' s 
media  planning  procedures  to  determine  the  impact  of 
different  schedules  and  different  levels  of  exposures 
(Kreshel,  Lancaster,  Toomey  1985,  p.38). 

The  aim  of  this  dissertation  is  to  answer  the  above 
research  questions  and  provide  actionable  model-based 
guidelines  to  advertisers  and  agencies.  Specifically,  I 
consider  the  problem  of  an  advertiser  who  has  a fixed  dollar 
budget  for  a brand  to  be  used  over  the  given  planning 
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horizon  in  order  to  build  the  brand's  top-of-mind  awareness 
and  must  decide  how  to  allocate  this  budget  over  time. 

Should  an  advertising  spending  be  concentrated  over  a short 
period  of  the  planning  horizon  (a  blitz  schedule)  or  spread 
over  uniformly  over  the  entire  planning  horizon  (an  even 
schedule)?  For  a blitz  schedule,  what  should  be  the  level  or 
intensity  of  spending  and  how  long  should  this  advertising 
burst  last?  More  generally,  for  any  multiple  bursts 
schedule,  what  should  be  the  intensity  of  ad  spending  in 
each  burst?  How  long  should  each  burst  last?  When  should  the 
next  burst  start? 

A related  issue  is  whether  the  quality  of  the 
advertisement  copy  is  affected  by  the  pattern  of  media 
spending?  For  example,  when  exposures  become  increasingly 
massed  (e.g.  in  a blitz  schedule),  advertisements  are  more 
likely  to  wear  out  (Pechman  and  Stewart  1990),  whereas  when 
ads  are  not  telecast  for  a while  and  reintroduced  later,  the 
ad  quality  can  restore  (Grass  and  Wallace  1969,  Greenberg 
and  Suttoni  1973)  . Then,  how  does  one  determine  the  rate  at 
which  the  advertisement  wears  out?  Due  to  the  dynamic  inter- 
relatedness of  the  spending  patterns  and  advertisement 
quality,  the  issues  of  advertising  wearout  and  pulsing 
patterns  of  media  schedules  need  to  be  studied 
simultaneously.  Only  then  an  advertiser  can  meaningfully 
determine  the  best  plan  of  allocating  advertising  budget 
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The  Organization  Of  Dissertation 

The  dissertation  is  organized  as  follows.  In  chapter  2, 
I review  the  extant  literature  on  media  scheduling  and  ad 
wearout.  This  review  shows  that  dynamic  response  models  of 
advertising  have  neglected  the  role  of  advertising  wearout. 
Consequently,  these  models  prescribe  that  monopolists  facing 
diminishing  returns  to  advertising  efforts  should  spend  the 
advertising  budget  uniformly  over  the  entire  planning 
horizon.  However,  the  prevalent  practice  is  to  schedule 
advertising  in  a few  bursts  interspersed  with  periods  of  no 
advertising.  In  other  words,  theory  prescribes  that  an  even 
schedule  is  the  best  policy,  but  practice  prefers  pulsing 
schedules . 

In  chapter  3,  I develop  a dynamic  advertising  response 
model  which  explicitly  incorporates  the  role  of  advertising 
wearout  and  the  evolution  of  advertisement  quality  during 
the  presence  and  absence  of  advertising.  Using  real  data  for 
a brand  of  cereal,  I support  the  model  formulation  and  test 
its  superiority  relative  to  other  models  in  the  literature. 

In  Chapter  4,  theoretical  issues,  such  as  whether  media 
resources  should  be  concentrated  or  not  or  what  should  be 
the  managerial  response  when  copy  or  repetition  wearout 
enhances,  involved  in  blitz  and  two-burst  schedules  are 
investigated  in  detail.  Then,  in  Chapter  5,  I extend  the 
model  to  the  general  situation  of  multiple  advertising 
bursts  and  present  a decision  support  system  which  can  help 
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advertiser  find  the  best  pulsing  schedule.  In  the  final 
chapter  6 I summarize  the  findings  and  provide  directions 
for  future  research. 

The  meanings  of  some  technical  terms  are  given  in  the 
glossary,  while  notations  are  introduced  in  Appendix  B. 


CHAPTER  2 
LITERATURE  REVIEW 

Overview 

Several  scholars  (e.g.  Rao  1970,  Sethi  1971,  Sasieni 

1971,  Little  1975,  1979,  H.  Simon  1982,  Mahajan  and  Muller 

1986,  Jones  1995)  have  noted  that  marketing  practitioners 

commonly  use  pulsing  schedules,  i.e.  alternating  advertising 

spending  between  high  and  zero  levels.!  This  motivated 

Sasieni ( 1971 ) to  analyze  the  problem  of  optimal  allocation 

of  media  budget  over  time.  He  reasoned  that 

...over  the  years  marketing  men  must  have  discovered 
some  optimal  advertising  strategies  by  empirical 
methods.  Since  many  people  believe  that  there  are 
circumstances  in  which  cyclic  or  pulsing  strategy  can 
be  optimal  it  should  be  possible  to  discover  a class  of 
response  functions,  whose  parameters  can  be  specialized 
to  yield  a cyclic  policy,  (p.  P-71) 

His  analysis  led  him  to  the  following  conclusion:  In  the 

presence  of  diminishing  marginal  returns  to  advertising 

expenditure,  the  optimal  strategy  is  to  reach  an  appropriate 

level  of  sales  (or  awareness)  and  to  maintain  it  with 

continuous  advertising  at  some  fixed  level  (also  see  Sasieni 


1 I use  the  terms  to  describe  media  schedules  consistent 
with  Mahajan  and  Muller  (1986,  p.91)  definitions,  which  is 
reviewed  later  in  this  chapter. 
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1989) . That  is,  cyclic  or  pulsing  policies  cannot  be  any 

better  than  an  even  spending  schedule.2 

Other  scholars  have  also  reached  a similar  conclusion 

(Nerlove  and  Arrow  1962,  Gould  1970,  Sethi  1971,  1973,  1977, 

Horsky  1977,  Rao  1985,  Mahajan  and  Muller  1986) . More 

generally,  using  optimal  control  theory,  Hartl  (1987) 

derives  the  monotonicity  theorem,  under  fairly  general 

conditions,  which  rules  out  any  non-monotonic  schedule  (such 

as  pulsing)  as  a candidate  for  the  optimal  policy. 

This  robust  finding  about  the  superiority  of  the  even 

spending  policy  has  had  two  consequences.  On  the  one  hand, 

the  practitioners  bemoaned  about  the  lack  of  'principles  of 

scheduling'  (e.g.  Corkindale  and  Kennedy  1975,  Bly  1993) 

because  they  have  had  little  normative  guidance  to  plan 

pulsing  schedules  (Strong  1977,  Zielske  and  Henry  1980, 

Kreshel  et  al.  1985,  Lancaster  et  al.  1986)  while,  on  the 

other,  the  academic  marketers  were  intrigued  by  the 

advertisers'  preference  for  pulsing  schedules  (e.g.  Little 

1979,  Naslund  1979,  H.  Simon  1982) . In  reviewing  the  state- 

of-the-art  of  advertising  models.  Little  (1979,  p.630) 

describes  the  situation  succinctly  as  follows: 

One  often  sees  media  scheduled  in  intensive  'flights' 
so  that  'the  message  can  be  heard  through  the  noise, ' 
but,  if  someone  asks  why  not  make  the  flight  half  as 
long  and  twice  as  intense  or  else  twice  as  long  and 
half  as  intense,  no  good  answers  can  be  given. 


2 In  an  even  spending  schedule,  the  firm  advertises  at  a 
constant  rate,  in  dollars  per  week,  throughout  the  planning 
horizon  (see  Mahajan  and  Muller,  1986,  p.  91) . 
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While  it  is  believed  that  "the  question  of  whether  a 
pulsed  advertising  schedule  is  superior  to  a schedule  of 
constant  spending  over  time  is  of  utmost  importance  (H. 

Simon  1982,  p.353),"  it  is  only  in  the  last  decade 
researchers  have  attempted  to  find  conditions  when  pulsing 
can  be  superior  to  even  spending  policy  (e.g.  Mahajan  and 
Muller  1986,  Mesak  1985,  1992,  Luhmer  et  al.  1988,  Hahn  and 
Hyun  1991,  Park  and  Hahn  1991,  Feinberg  1992,  Villas-Boas 
1993) . 

However,  this  stream  of  research  does  not  provide 
insight  into  how  to  design  the  'superior'  pulsing  schedule. 
In  other  words,  even  today,  no  good  answers  can  be  given  for 
the  duration  and  intensity  of  the  pulsing  schedule  that 
should  be  planned  to  expend  media  budget  meaningfully. 

In  the  meanwhile,  the  practice  of  advertising  continues 
to  use  pulsing  schedules  almost  universally  (Jones  1995) , 
and  it  seeks  formal  guidance  on  determining  optimal  pulsing 
schedule  (Malkin  1993) . Interestingly,  an  overwhelming 
majority  (95.7%)  of  media  directors  of  large  advertising 
agencies  in  the  U.S.  believe  that  "research  must  be  done  to 
determine  the  exact  impact  on  sales  of  such  things 
as ...  different  scheduling  techniques  and  different  levels  of 
exposure  (Kreshel,  Lancaster,  and  Toomey  1985,  p.38)."  This 
dissertation  intends  to  provide  such  guidance. 

Extant  dynamic  response  models  of  advertising  do  not 
capture  reality  adequately  because  they  have  missed  an 
essential  empirical  feature:  advertising  wearout,  i.e. 
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decline  in  the  quality  of  advertising  (Grass  and  Wallace 
1969,  Greenberg  and  Suttoni  1973,  Pekelman  and  Sethi  1978, 
Axelrod  1980,  Calder  and  Sternthal  1980,  Blair  1987,  Pechman 
and  Stewart  1990).  Buzzell  (1964,  p.27)  notes  that  "...  the 
marketing  models  based  on  dollar  expenditures  alone  cannot 
provide  sufficient  explanations  of  market  response  to 
advertising"  and  argues  for  the  existence  and  importance  of 
ad  quality.  Based  on  the  empirical  analysis,  Buzzell  (p.31) 
says. 

If  anything,  advertising  message  quality  is  more 
important  than  the  level  of  advertising  expenditure. 
Message  quality  is  neither  insignificant  nor  invariant, 
as  has  often  been  assumed  in  models  of  advertising 
process . 

In  spite  of  this,  the  notion  of  ad  quality  (also  see 
Parsons  and  Schultz  1976,  Arnold  et  al.  1987)  or 
equivalently  copy  effectiveness  (Little  1975)  is  often 
ignored  in  the  modeling  literature.  Indeed,  there  is  no 
study  which  explicitly  specifies  the  dynamics  of  ad  quality 
by  incorporating  the  role  of  advertising  wearout  (Kamins 
1981,  Pechman  and  Stewart  1990),  or  builds  the  theoretical 
link  between  advertising  wearout  and  media  scheduling  as 
recognized  by  Ray  (1975,  p.51).  Hence,  relaxing  the 
assumption  of  constant  ad  quality  is  essential  to 
demonstrate  that  pulsing  can  be  superior  to  an  even  schedule 
as  well  as  to  design  an  optimal  pulsing  schedule. 

A brief  review  of  the  above  literature  is  organized  as 
follows.  First  I state  the  definitions  of  various  media 
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schedules.  Then,  I will  describe  the  results  of  the  models 
which  assert  even  spending  policy  as  being  the  optimal 
spending  strategy.  Next,  I will  review  the  models  from  the 
pulsing  literature  which  identify  conditions  when  this 
superiority  of  even  policy  does  not  hold.  Finally,  I will 
review  the  literature  on  advertising  wearout,  advertising 
quality  and  overlearning  to  the  extent  it  is  of  relevance  to 
my  model  development  in  the  next  chapter. 


Definition  of  Media  Schedules 


The  following  definitions  of  various  media  schedules 
are  based  on  Mahajan  and  Muller  (1986) . Some  examples  are 
illustrated  in  panels  (a)  through  (e)  of  figure  1 to  clarify 
the  meaning. 

1.  Blitz.  A one  pulse  schedule  in  which  the  advertiser 
concentrates  all  its  efforts  in  some  initial 
period.  See  panel  (a) . 

2.  Pulsing.  A schedule  in  which  the  advertiser 
alternates  between  high  and  zero  level  of  spending 
rate.  See  panels  (b)  and  (c) . 

3.  Chattering.  A theoretical  schedule  in  which  the 
advertiser  alternates  between  high  and  zero  level 
of  spending  infinite  number  of  times  during  some 
finite  planning  horizon. 

4.  Even.  A schedule  in  which  the  advertiser  spends  at 
some  constant  level  throughout  the  planning 
horizon.  See  panel  (d) . 

5.  Pulsing  Maintenance.  A schedule  in  which  the 
advertiser  combines  any  of  the  above  schedules  with 
a low  level  of  advertising,  usually  a maintenance 
level.  For  example,  see  panel  (e)  . 
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I will  use  the  above  terminology  to  refer  to  the  various 
media  schedules. 

Media  Scheduling  Strategies 


Optimal  Advertising  Policies 

Sasieni  (1989)  considers  a very  general  class  of 
dynamic  response  functions  of  the  type 

dA 

— = g(u,  A) 
dt 

with  the  following  properties  for  the  response  function 
g(u,A)  : 

* 

i.  Response  is  an  increasing  function  of  advertising, 
i.e.,  dq/dn  > 0. 

ii.  Response  is  a decreasing  function  of  awareness, 
i.e.,  dq/dA  < 0 . 

iii.  Rate  of  decrease  with  respect  to  awareness  never 
decreases,  i.e.,  52g/3A2  < 0. 

iv.  There  is  a steady  state  level  of  awareness  for  any 
amount  of  advertising,  i.e.  for  all  u > 0,  there 
exists  A(u)  such  that  g(u,  A(u))=0. 

v.  Rate  of  increase  of  response  with  respect  to 

advertising  decreases  as  awareness  increases,  i.e., 
d2g/du5A  < 0 . 

The  model  specified  by  g ( . , . ) is  quite  flexible  and 
nests  important  special  cases,  as  indicated  below. 


Spending 
Rate  u(t) 


Panel  (a) 
Blitz  Schedule 


Ublitz 


Spending 


Panel  (b) 


Spending  Panel  (c) 

Rate  u(t)  a 3-pulse  Schedule 


Spending 
Rate  u(t) 


Panel  (d) 
Even  Schedule 


Ueven 


T 


Spending  Panel  (e) 

Rate  u(t)  Pulsing  Maintenance  Schedule 
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(a)  Vidale-Wolfe  Model  (1957)  in  which  g (u.  A)  = (m-A)  ru- 
bA,  where  m,  r,  b are  constant  parameters. 

(b)  Nerlove-Arrow  Model  (1962)  in  which  g(u,A)  = u-5A 
where  5 is  the  forgetting  rate. 

(c)  Mahajan  and  Muller  (1986)  in  which  g(u,A)  = (l- 
A)/(U)-8A. 

As  noted  by  Sasieni  (1989),  the  five  properties  of 
g(u,A)  does  not  capture  the  phenomenon  of  hysteresis 3 as 
described  by  H.  Simon  (1982)  based  on  Haley's  (1978)  data. 

Nerlove  and  Arrow  (1962)  studied  the  optimal  policy 
u*(t)  for  their  model  in  (a).  Gould  (1970)  extended  Nerlove- 
Arrow'  s analysis  to  include  nonlinear  costs  of  advertising. 
Srinivasan  (1974)  and  Sethi  (1977)  develop  the  optimal 
policy  for  Nerlove-Arrow' s model.  Sethi  (1973)  characterized 
the  optimal  advertising  policy  for  Vidale-Wolfe' s model  in 
(b) . Sethi  (1971)  compared  pulsing  versus  continuous 
spending  policies.  Mahajan  and  Muller  (1986)  formally 
defined  the  terms  for  several  policies  and  then  compared 
their  performance  for  the  model  formulation  in  (c)  in  which 
/( u)  is  S-shaped  or  concave. 


3 The  term  hysteresis  in  this  context  means  that  the  time 
path  traced  by  the  state  variable  awareness  A(t)  when 
advertising  is  increased  is  not  the  same  as  its  path  traced 
back  when  advertising  is  decreased.  Note  that  Simon  (1982) 
and  Mesak  (1992)  refer  to  this  phenomenon  as  'advertising 
wearout . ' 
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This  stream  of  research  asserts  that  chattering  is 
optimal  when  g(u, .)  is  convex  or  S-shaped,  while  an  even 
schedule  performs  the  best  when  g(u, .)  is  linear  or  concave. 

Next  I explain  the  intuition  for  the  chattering  policy 
when  g(u,  .)  is  S-shaped,  then  present  the  monotonicity 
theorem  which  rules  out  pulsing  schedules  for  concave 
g(u,  . ) . 

Chattering  schedule 

Consider  a S-shaped  function  as  shown  in  figure  2-2. 
This  function  is,  for  example,  /( u)  in  Mahajan  and  Muller's 
formulation  described  earlier. 


Figure  2-2.  Optimal  spending  under  S-shaped  response. 


In  the  above  figure,  u is  the  point  of  inflection  which 
splits  the  curve  in  the  lower  convex  portion  (where  marginal 
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returns  to  advertising  is  increasing)  and  the  upper  concave 
portion  (where  diminishing  marginal  returns  sets  in) . 

Sasieni  (1971)  argues  that  when  an  advertiser  faces  a 
convex  response  or  due  to  budgetary  constraints  his  spending 
rate  falls  between  (0,  u) , the  advertiser  is  better  off  by 
alternating  his  spending  between  two  points  u = 0 (origin) 
and  u = u (tangency  point^) . By  alternating  infinitely 
often,  in  theory,  the  convex  part  of  f(u)  is  linearized  by 
the  tangent  line.  Hence,  the  response  achieved  under  such  a 
chattering  spending  strategy  is  fi  which  is  always  greater 
than  f2,  the  response  due  to  spending  at  some  steady  rate  au 
, for  any  a,  0 < a < 1. 

Further,  when  finite  frequency  pulsing  is  used,  shorter 
cycle  times  are  better  than  longer  ones  (see  Theorem  2, 
Sasieni  1989) . That  is,  equivalently,  more  number  of  pulses 
in  a finite  planning  horizon  are  desirable.  Mahajan  and 
Muller  show  that  as  few  as  3-4  pulses  can  achieve  90%  of  the 
maximum  theoretical  response  due  to  the  infinite  frequency 
chattering  policy. 

An  even  schedule 

That,  for  concave  or  linear  g(u, .),  pulsing  can  never 
be  optimal  for  previously  described  models  (e.g.  Vidale- 
Wolfe,  Nerlove-Arrow,  Mahajan-Muller)  is  asserted  by  the 
following  theorem,  due  to  Hartl  (1987) . 


4 The  tangency  point  u is  a positive  real-valued  root  of  the 
following  equation:  df/dx  = f(x)/x. 
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Monotonicity  Theorem.  Let  state  A:[0,oo)— >•  R and 
control  u:[0,oo)->  Rn.  Assume  that  functions  F: RxRn  — » R, 
g:RxRn  — > R,  and  c:RxRn  ->  R are  continuous  in  control  u 
and  continuously  differentiable  in  state  A.  Consider 
the  following  control  problem: 


Maximize  1 e rt  F(u,  A)  dt 

u(t)  o 

subject  to 


dA 

dt 


= g(u.  A)  , 


c(u,  A)  > 0, 


A(0)  = A0  given. 


Denote  u*(t)  and  A*(t)  as  the  optimal  solutions  of  the 
autonomous  one  state  variable  problem. 

Then,  if  A*(t)  is  unique  for  all  t > 0,  A*  (t)  is 
monotonic  for  all  t > 0. 


Note  that  the  monotonicity  of  A*  (t)  induces  monotonicity  of 
u*(t),  due  to  the  differential  equation  A = g(u.  A)  . 


Interpretation.  The  monotonicity  theorem  can  be 
interpreted  as  follows  (Hahn  and  Hyun  1991,  p.159).  The 
control  variable  u(t)  can  be  the  spending  rate  such  as 
GRPs/week.  The  state  variable  A(t)  represents  the  effects  of 
advertising  such  as  generation  of  awareness  (Mahajan  and 
Muller  1986)  or  building  up  the  stock  of  goodwill  (Nerlove- 
Arrow  1962)  or  sales  (Vidale-Wolf e 1957) . The  function 
F (u, A)  is  the  instantaneous  revenues  or  profit,  and  the 
factor  e”rt  is  discounting  it  to  the  present  value.  The 
function  g(u,A)  is  the  dynamic  response  model  under 
consideration  (e.g.  Nerlove-Arrow' s model),  while  the 
function  c(u,A)  represent  managerial  constraints  (e.g.  for 

fixed  budget  the  constraint  c(u.  A)  =>  J udt  = B ) . The 

0 
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advertiser's  goal  is  to  maximize  revenues  (or  profits) 
subject  to  the  dynamics  of  awareness  evolution  g(.,.)  and 
managerial  constraints  c(.,.). 

This  theorem  assures  that  for  models  with  a single 
state  variable,  such  as  awareness  A(t),  the  optimal  state 
trajectory  is  monotonic,  which  implies  that  the  optimal 
control  trajectory  is  also  monotonic  (also  see  Luhmer  et  al. 
1988,  Hahn  and  Hyun  1991) . This  property  of  the  optimal 
control  path  rules  out  pulsing  schedules  as  candidates  for 
the  optimal  solution  due  to  their  non-monotonic  nature  (e.g. 
on/off) . 

Thus,  the  above  studies  prescribe  either  even  or  the 
chattering  policy  as  superior  to  any  finite  frequency 
pulsing  schedule  for  a broad  class  of  dynamic  advertising 
response  models.  Next  I will  review  the  empirical  and 
theoretical  literature  that  identifies  conditions  when 
pulsing  schedules  can  be  superior  to  an  even  schedule. 

Empirical  Pulsing  Schedules 

Field  experiments 

In  1959,  Zielske  conducted  a field  experiment  to 
measure  recall  and  forgetting  of  advertisements.  Two 
randomly  selected  group  of  women  were  exposed  to  thirteen 
print  advertisement  messages  for  a food  ingredient  product. 
One  group  was  exposed  at  a frequency  of  once  every  week  for 
thirteen  weeks  and  nothing  for  the  rest  of  the  year  (a  blitz 
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schedule) . The  other  group  received  the  same  thirteen 
advertisements  at  a frequency  of  once  every  four  weeks  over 
the  year  (a  13-pulse  schedule) . Advertising  recall  was 
measured  in  both  groups  by  telephone  interview.  The  results 
of  Zielske's  classic  study  are  given  in  table  2-1  below. 


Table  2-1.  Empirical 

performance  of  media  schedules 

Schedule 

Cost 

$ 

% Recall3 

Nos.  Recall 
per  dollar13 

Blitz  (weekly) 

650 

21.0 

C\J 

• 

Pulsed  (monthly) 

650 

29.0 

5.8 

a.  Percentage  of  housewives  who  could  remember  the 
advertised  message,  averaged  over  52  weeks. 

b.  Number  of  housewives  who  could  remember  the  advertised 
message  per  dollar  of  advertising  cost,  averaged  over  52 
weeks . 


Using  total  awareness  as  a measure  of  performance,  it 
was  concluded  by  Zielske  (1959),  J.  Simon  (1979),  and 
Mahajan  and  Muller  (1986)  that  the  monthly  pulsed  schedule 
was  empirically  superior  to  the  (weekly)  blitz  schedule. 

This  is  consistent  with  previous  theoretical  results  (for 
example,  more  number  of  pulses  are  desirable,  Mahajan  and 
Muller  1986) . 

Ten  years  later,  a similar  field  experiment  with  three 
scheduling  conditions  (weekly,  biweekly,  and  monthly)  was 
conducted  by  Strong  (1972) . Based  on  his  empirical  analysis, 
Strong  (1974,  p.377)  concluded  that  "...schedules  with 
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flights  should  be  considered  as  practical 

alternatives ...  that  a schedule  with  flights  obtains  greater 
average  annual  recall  in  the  audience  than  an  even  spaced 
schedule ..." 

The  insights  from  field  studies  later  led  to  the  search 
for  an  answer  to  the  key  question:  Which  Scheduling  Option 
is  the  Best?  An  empirical  study  by  Zielske  and  Henry  (1980) 
addresses  this  issue. 

Zielske  and  Henry  (1980) : An  econometric  study 

In  1980,  Hubert  Zielkse,  as  national  director  for 
research  projects  at  Foote,  Cone  & Belding,  was  primarily 
interested  in  determining  the  best  scheduling  option.  He 
conducted  an  extensive  empirical  analysis  by  relating 
unaided  recall  (awareness)  to  gross  rating  point  (GRPs) . The 
details  of  the  study  are  given  in  Zielske  and  Henry  (1980) . 

Awareness  measurements  were  obtained  from  seventeen 
tracking  studies  on  six  products  and  services,  which 
employed  a variety  of  flighting  patterns.  The  total  ratings 
(a  surrogate  for  media  budget)  across  product  categories 
were  between  1600  to  4000  GRPs,  which  were  allocated  in  a 
pulsing  pattern  with  flight  lengths  ranging  from  8 to  20 
weeks.  The  best  fitting  econometric  model  was  determined  as 

AAt  = At  - At_1  = 0.030  ut  - 0.092  At_1 

where  A denotes  percent  awareness,  u denotes  weekly  GRP,  and 
t is  a time  subscript. 
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Using  their  fitted  model,  Zielske  and  Henry  (1980) 
proceeded  to  determine  the  best  policy  by  trial  and  error. 
They  considered  the  following  five  media  plans  which 
allocates  a hypothetical  budget  of  1300  rating  points  over  a 
fifty-two  week  planning  horizon. 

1.  Plan  A.  Thirteen  consecutive  weeks  at  100  rating 
points  per  week.  ( Blitz  Schedule) 

2.  Plan  B.  Twenty-six  consecutive  weeks  at  50  rating 
points  per  week.  (Blitz  Schedule) 

3.  Plan  C.  Fifty-two  consecutive  weeks  at  25  rating 
points . (Even  Schedule) 

4.  Plan  D.  100  rating  points  at  four  week  interval. 
(13-pulse  Schedule) 

5.  Plan  E.  100  rating  points  per  week  for  the  first 
seven  weeks,  a nineteen-week  hiatus,  100  rating 
points  per  week  for  the  next  six  weeks,  and  nothing 
for  the  remaining  period.  (2-pulse  Schedule) 

The  awareness  generated  by  the  above  media  plans  is 

shown  in  figure  2-3  below.  They  concluded  that, 

...there  are  many  recall  patterns  that  can  be  achieved 
within  the  same  budget.  ...some  patterns  will  be  more 
productive  than  others.  ...but  it  will  not,  in  itself, 
answer  the  question:  Which  scheduling  option  is  the 
best? 


This  conclusion  is  intriguing  because  the  best 
scheduling  option  can  be  determined.  It  is  the  one  which  has 
the  largest  area  under  these  awareness  curves  in  figure  2-3. 
This  can  be  easily  obtained,  as  given  in  table  2-2. 
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Table  2-2.  Performance  of  various  media  plans 


Media  Schedule 

Plan  A 

Plan  B 

Plan  C 

Plan  D 

Plan  E 

Total 

Awareness3 

424.45 

425.13 

432.01 

430.88 

426.2 

a.  Total  awareness  is  computed  by  adding  up  awareness  in 
each  period  as  well  as  the  terminal  value,  i.e. 


The  above  computations  show  that  the  best  schedule  is 
Plan  C,  i.e.  the  even  schedule.  The  results  are  consistent 
with  Zielske's  (1959)  finding  that  Plan  D is  superior  to  a 
blitz  schedule  in  Plan  A. 
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This  is  as  it  should  be,  since  their  best  fitting  model 
is,  indeed,  a discrete  version  of  Nerlove-Arrow' s model.  It 
seems  as  if  the  practitioners  somehow  feel  uncomfortable 
with  this  conclusion,  as  conjectured  by  Sasieni:  "...  it 
might  be  hard  to  persuade  the  marketing  VP  ...  this  [an  even 
schedule]  will  maximize  his  returns."  (Sasieni  1989,  p. 

368)  . 

Summary.  Several  empirical  studies  attempted  to 
determine  the  best  pulsing  schedule,  but  without  much 
success.  Besides  Zielske  and  Henry's  (1980)  study.  Strong 
(1974)  designed  a 'schedule  simulator,'  called  SKEDADDLE, 
with  the  hope  of  finding  the  best  pulsing  schedule.  But, 
unfortunately,  even  this  simulator  cannot  achieve  its  goal 
in  the  light  of  Hartl's  (1987)  monotonicity  theorem. 

Hence,  the  practice  of  pulsing  as  well  as  the  attempts 
to  find  the  best  pulsing  schedule  still  continues.  For 
example,  the  advertising  agency  J.  Walter  Thompson  Co.  is 
currently  testing  a media  planning  software,  named  SESAME, ^ 
whose  performance  and  optimization  characteristics  are 
unknown  (Malkin  1993) . 


5 An  acronym  for  System  of  Evaluating,  Setting  & Allocating 
Media  Expenditure.  Ms.  Lilia  Barroso,  media  director  for 
Latin  America,  says  that  "...  with  Sesame,  you  have  more 
flexibility  in  how  you  spread  your  advertising  throughout 
the  year  (Malkin  1993,  p.I-14)." 
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Theoretical  Pulsing  Literature 


In  this  section  I will  review  some  important  models 
which  identify  conditions  under  which  pulsing  is  superior  to 
an  even  schedule. 

Naslund  (1979) 

This  is  the  first  study  which  identifies  the  condition 
for  cyclic  advertising  policy.  Using  Nicosia's  (1966) 
consumer  behavior  theory  as  the  basis,  Naslund  formulates  a 
dynamic  model  with  three  state  variables:  sales  Xi(t), 
motivation  x2(t),  and  attitude  x3(t).  The  formal  model  is 
stated  below. 


Max  I (px,  - u)  e rtdt 

u(t)  o 

dx, 

— = b [x3  - pxj 

dx2 

— — = a [Xj  - otx2]  + c u(t) 

dt 

x3(t)  = m x2(t) 

The  equation  (1)  expresses  that  the  goal  is  to  find 
some  optimal  spending  pattern,  u*(t),  which  maximizes  the 
present  value  of  profit  stream,  subject  to  the  evolution 
specified  in  (2) -(4).  The  equation  (4)  relates  motivation 
and  attitude  linearly  and  therefore,  without  loss  of 
generality,  x3(t)  can  be  ignored.  That  is,  substitute  x3  = 
mx2  in  (2)  and  solve  the  two-state,  one-control  problem. 


(1) 

(2) 

(3) 

(4) 


The  dynamics  of  attitude  in  (3)  are  critical.  Assume 
that  the  parameter  a is  negative.  Equation  (3)  can  then  be 
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interpreted  as  follows.  The  increase  in  attitude,  dx2/dt,  is 
driven  by  the  current  level  of  attitude  itself  x2(t)  as  well 
as  the  spending  rate  u(t).  Substantive  implication  is  that 
the  currently  favorable  attitudes  and  sustained  advertising 
builds  positive  attitudes  toward  to  brand  in  the  future. 
Further,  the  magnitude  of  attitude  change  is  large  for  new 
brands  (with  low  level  of  sales  Xi(t))  and  small  for  the 
established  ones  (with  large  Xi(t)). 

Naslund  (1979)  solves  the  formal  optimal  control 
problem  specified  in  (l)-(4).  A cyclic  solution  for  the 
optimal  policy  u*(t)  is  obtained  when  the  roots  of  the 
characteristic  equation  are  complex  (see,  e.g.,  Kamien  and 
Schwartz  1991),  which  implies  the  following  parametric 
condition: 6 

| 4bma | > (aa-pb)2  (5) 

When  (5)  holds,  the  co-state  variable6 7  ^,2(t)  associated 
with  the  attitude  state  is  cyclic.  For  example,  the 
parameters  constellation  p = b = m = 1,  a = -1,  a = P = ^, 
and  r = 5%  yields 

„ 1600  T/„  V3  V3 

X2(t)  = {1  - e 7 - cos  — x - 10V3  sin  — x}  (6) 

2 1204  2 2 

where  x = t-T.  Assuming  a planning  horizon  T = 52  weeks,  the 

cyclic  X2(t)  is  shown  in  figure  2-4.  The  optimal  advertising 


6 Technically,  this  condition  is  obtained  when  the 
discriminant  is  negative  so  that  the  roots  of  the  quadratic 
are  complex. 

7A1so  known  as  adjoint  variable  or  'shadow  price' . See 
Kamien  and  Schwartz  1990. 
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policy  is  to  advertise  when  X2(t)  > , and  to  not 

c(t) 

advertise  otherwise.  (Note  that  c(t)  is  the  time-varying  ad 
response  coefficient  in  equation  3.) 


Figure  2-4.  Optimal  pulsing  schedule  in  which  advertising  is 
on  when  the  cyclic  curve,  A,2(t),  is  above  the  dotted  curve, 
l/c(t),  and  it's  off  otherwise. 


But  Muller  (1983)  argues  that  the  condition  in  (5)  is 
unlikely  on  logical  grounds.  Both  Naslund  (1979)  and  Muller 
(1983)  do  not  provide  any  empirical  support  to  their 
respective  claims.  Empirical  research  to  resolve  this  issue 
is  sorely  needed  because  Naslund' s model,  in  theory,  can 
address  several  unresolved  theoretical  concerns  raised  by 
Simon  (1982)  or  Mesak  (1992) . 
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ADPULS  (H.  Simon  1982) 

The  key  feature  of  this  model  is  the  asymmetric 
response  to  increase  and  decrease  in  advertising  spending, 
i.e.  there  is  incremental  gain  when  spending  level  is 
stepped  up  but  not  as  much  loss  when  it  is  stepped  down. 

Simon  (1982)  assumes  the  following  dynamic  asymmetric 
response  function. 

qt  = a + b + c Ln(ut)  + d max{0,  (ut  - ut-1)  } (7) 

' V ' 

gain/loss  term 

He  justifies  this  form  based  Helson' s (1964)  adaptation 
level  theory.  He  argues  that  market  response  depends  not 
only  on  the  current  level  of  advertising  but  also  on  the 
difference  between  the  current  advertising  level  and  the 
level  of  advertising  the  market  is  adapted  to.  This 
adaptation  level,  according  to  Helson  (1964),  is  the 
geometric  mean  of  previous  ad  spending  levels.  For 
simplicity,  however,  Simon  assumes  the  adaptation  level  to 
be  the  last  period  ad  spending  itself.  Hence,  the  term  (ut- 
ut-i)  in  the  response  function. 

It  may  be  noted  that  when  spending  is  increased,  the 
term  Aut  = ut  -ut-i  becomes  positive,  and  there  is  sales  gain 
of  dAut.  However,  when  spending  is  decreased,  the  term  Aut 
is  negative,  so  the  maximum  between  {0,Aut}  is  zero,  and 
hence  the  gain/loss  term  is  zero,  i.e.  there  is  no  penalty 
to  lowering  advertising  level. 
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Simon  (1982)  refers  to  this  type  of  asymmetric  response 
as  'advertising  wearout  phenomenon, ' although  Sasieni  (1989, 
p.360)  correctly  views  it  as  hysteresis  (see  footnote  3). 

Under  asymmetry  of  benefits  to  switching  between  the 
spending  levels,  it  makes  sense  to  increase  spending  in 
small  steps  and  decrease  in  one  large  step  and  continue  this 
pattern  repeatedly,  rather  than  spending  evenly.  Simon 
(1982)  supports  this  intuition  with  analysis  and  empirical 
data . 

Thompson  (1981),  Luhmer  et  al.(1988),  and  Mesak  (1992) 
study  the  continuous  time  analog  of  Simon's  (1982)  model  and 
essentially  reach  the  same  conclusion.  Next,  I will  briefly 
describe  Luhmer  et  al.  (1988)  so  that  it  is  apparent  that 
Simon's  (1982)  formulation  also  involves  a multiple  state, 
single  control  problem;  hence,  Hartl's  (1987)  theorem,  which 
applies  to  one  dimensional  autonomous  control  problem,  is 
not  applicable. 

Luhmer,  Stendl,  Feichtinger,  Hartl,  Sorger  (1988) 

The  continuous  time  analog  for  Simon's  (1982)  model 
involves  the  formulation  of  the  adaptation  level,  which  is 
really  a 'state-variable.'  Adaptation  level  is  an 
exponential  smoothing  of  previous  advertising  efforts  and  it 
is  modeled  by  the  differential  equation: ^ 

8 The  logic  of  this  differential  equation  becomes 
transparent  in  the  discretized  version:  St+i-St  = y (At-St)  => 

St+i  = YAt+(l-y)St.  The  right  hand  side  of  the  last  expression 
is  a weighted  average  of  current  advertising  effort  At  and 
the  current  adaptation  level  St. 
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(8) 


where  S(t)  denotes  the  adaptation  level,  and  A(t)  is  the 
advertising  effort. 

Luhmer  et  al . (1988)  extend  Simon's  formulation  by 

treating  advertising  effort  A(t)  as  goodwill  rather  than  the 
direct  intensity  of  ad  spending  u(t).  Following  Nerlove  and 
Arrow  (1962),  then. 


where  A(t)  is  goodwill  and  u(t)  is  the  advertising  spending 
rate  (e.g.  dollars/week) . 

It  is  important  to  understand  why  Luhmer  et  al . require 
'goodwill'  as  an  additional  state  variable  in  their  model, 
although  Simon's  ADPULS  doesn't  have  it.  In  Simon's  discrete 
time  ADPULS  model,  it  is  profitable  to  step-up  and  step-down 
spending  levels  due  to  the  nature  of  asymmetric  response. 

But  there  is  no  behavioral  reason  as  to  why  this  switching 
cannot  be  carried  out  infinitely  often.  Indeed,  in  the 
continuous  time  version,  the  infinite  frequency  pulsing  is 
optimal.  But  this  is  only  a theoretical  policy  called 
chattering,  not  pulsing  as  desired.  In  order  to  make  a 
finite  frequency  pulsing  as  optimal,  some  inertia  is 
required  on  the  motion  of  the  chattering  policy  u(t).  Such 
inertia  is  induced  by  an  exponentially  averaging  filter  — 
the  advertising  goodwill  formulation  in  (9). 

Simon's  response  function  in  (7)  is  now  modeled  as 


dA 

= u(t)  - aA 

dt 


(9) 


follows . 
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— = b In  (1  + A)  - 5q  + w(A  — S)  (10) 

dt 

where  q(t)  is  the  sales  at  time  t,  A(t)  is  goodwill,  and 
S(t)  is  the  adaptation  level.  Note  that  (1+A)  term  is  used, 
instead  of  simply  A (or  ut  as  in  eqn.  7),  to  ensure  that 
logarithmic  term  does  not  explode  at  A = 0. 

The  most  important  feature,  i.e.,  the  asymmetry,  is 
modeled  by  defining  a kink  function 

w(A  - S)  : = max  {0,  w(A  - S)  } (11) 

where  w is  a positive  constant.  The  form  in  (11)  captures 
the  intuition  that  only  when  goodwill  A(t)  is  above  the 
adaptation  level  S(t)  that  there  is  a positive  gain,  while 
if  goodwill  is  below  the  adaptation  level  then  there  is  zero 
gain. 

The  formal  control  problem  is  solved  to  maximize  the 
discounted  profit  stream,  that  is. 

Maximize  I e~rt(7cq  - cu)  dt 

u(t)  o 

subject  to  the  dynamics  specified  by  (8)  — (11)  . Here, 
constant  r is  the  discount  rate,  n is  the  price  per  unit 
sold,  and  c is  the  unit  cost  of  advertising. 

The  model  contains  three  state  variables:  sales, 
goodwill,  and  adaptation  level.  Its  analytical  solution 
being  difficult,  a numerical  solution  is  presented  (Luhmer 
et  al.,  p.  174).  There  is  no  empirical  support  for  any  of 
the  following. 


1.  The  asymmetric  shape  of  the  response  function, 
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2.  The  use  of  the  individual  level  construct  of 
adaptation  level  to  aggregate  market  response, 

3.  The  optimal  pulsing  schedule  that  can  be 
practically  implemented. 

Hopefully,  future  research  will  fill  this  void. 

Mesak  (1985),  (1992),  Mesak  and  Darrat  (1992) 

Both  Mesak  (1985)  and  Mesak  and  Darrat  (1992)  formulate 
a single-state  variable  control  problem,  and  hence  Hartl's 
theorem  applies.  Consistent  with  the  previous  literature 
(e.g.  Sasieni  1971),  the  results  suggest  that  an  even 
schedule  is  superior  for  concave  response  functions,  and 
chattering  is  superior  under  convex  or  S-shaped  response 
functions.  Mesak  and  Darrat  (1992)  determine  the  shape  of 
the  response  function  for  Lydia  Pinkham' s data  (Palda  1964). 
The  response  is  a concave  function  under  annual  and  monthly 
data  intervals,  as  expected  (see  an  extensive  review  by  J. 
Simon  and  Arndt  1980) . 

Mesak  (1992)  combines  Vidale-Wolfe  dynamics  and  Simon's 
asymmetric  function  to  derive  a new  aggregate  market 
response  function.  He  then  investigates  the  role  of  discount 
rate  in  scheduling  decisions,  in  which  the  pulsing 
superiority  is  driven  by  the  asymmetry  (see  Simon  1982) . The 
contribution  is  to  enhance  the  understanding  of  managerial 
behavior  under  impatience  or  short-term  profit  orientation. 

The  results  indicate  that  pulsing  is  superior  to  an 
even  schedule  for  zero  discount  rate.  Pulsing  is  superior  to 
an  even  schedule  for  non-zero  discount  rate  as  well, 
provided  the  initial  sales  are  close  to  equilibrium  sales 
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under  an  even  schedule.  If  initial  sales  are  not  close  to 
the  equilibrium,  then  pulsing  is  superior  only  when  discount 
rate  is  small,  while  an  even  schedule  is  superior  for  large 
discount  rates.  There  is  no  empirical  validation  of  these 
logically  deduced  propositions. 

An  interesting  testable  implication  of  his  model  is 
that  firms  with  short-term  orientation  (large  discount  rate) 
with  recently  launched  new  products  (sales  far  from 
equilibrium)  are  more  likely  to  employ  even  schedules.  A 
survey  based  empirical  study  can  verify  it  in  the  future. 
Hahn  and  Hyun  1991 

Their  model  introduces  the  concept  of  'pulsing  cost' 
which  is  incurred  only  when  media  advertising  is  switched 
on,  and  some  'fixed  costs'  incurred  to  keep  the  advertising 
on.  These  costs  are  in  addition  to  the  conventional  variable 
costs  of  media,  which  are  proportional  to  the  GRPs  bought. 

The  formal  control  problem  is  stated  as  follows: 


Maximize  J e rtS(u,  A)dt  - 

u(t)  o 


subject  to 


dt 

S(u,  A)  = B(A)  - W(u)  , 


dA 

— — = pd  — A)u  - 8a  , 


W(u)  = F a(u)  + C(u) , 

0 < u < u , 

A > 0,  A(0)  = A0 


The  interpretation  of  the  above  problem  is  as  follows. 
The  goal  of  the  advertiser  is  to  determine  the  best  spending 
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pattern  u*(t)  that  maximizes  the  discounted  stream  of 
profits  over  the  entire  period  [0,oo).  For  any  spending 
pattern  u(t),  there  is  an  associated  pattern  of  awareness 
A(t)  generated  by  the  assumed  Vidale-Wolfe  dynamics.  The 
profits  (except  for  the  pulsing  costs  P)  are  given  by  the 
function  S(u,A),  which  is  decomposed  as  benefits  B (A)  minus 
costs  W(u).  The  cost  W(u)  are  of  two  types:  fixed  cost  F and 
variable  costs  C(u).  The  fixed  cost  are  incurred  while 
advertising  is  on,  as  expressed  by  the  indicator  variable 
a(u)  which  equals  unity  when  ads  are  on,  and  zero 
otherwise.  The  pulsing  cost  P can  be  viewed  as  the  "start- 
up" cost.  It  is  incurred  only  at  discrete  points  of 
transition  when  advertising  is  switched  on  (as  indicated  by 
the  set  I in  the  summation) . The  spending  rate  (control)  is 
bounded  between  [0,  u],  awareness  is  always  non-negative, 
and  the  initial  awareness  level  Ao  is  assumed  as  given. 

The  analysis  of  the  above  problem  requires  the  use  of 
results  from  the  literature  on  optimal  oscillation  in 
control  models  (Sorger  1986,  Feichtinger  and  Sorger  1986) 
and  optimal  use  of  renewable  resources  (Lewis  and 
Schmalensee  1979,  1982) . Using  this  literature,  Hahn  and 
Hyun  (1991)  derive  new  results  which  can  be  summarized  in 
the  following  theorem. 

Theorem.  Assume  that  nonconvexity  exists  in  the  cost 
function  in  the  form  of  fixed  advertising  costs.  Assume 
further  that  there  exists  pulsing  costs  P associated 
with  each  pulse.  Then,  there  exists  P*  such  that 

a.  if  P > P*,  the  even  policy  is  optimal, 

b.  if  0 < P < P*,  the  pulsing  policy  is  optimal. 
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c.  if  P = 0,  the  chattering  policy  is  optimal. 

The  intuition  here  is  that  the  ratio  of  pulsing  to 
fixed  cost,  P/F,  determines  whether  pulsing  is  beneficial  or 
not.  (Here  P*  is  the  some  threshold  cost  of  pulsing 
associated  with  every  level  of  fixed  cost  F.)  When  pulsing 
costs  are  low  (P  < P*),  alternating  between  two  levels  u=0 
and  u=u  is  desirable  because  the  advertiser  incurs  pulsing 
cost  P and  fixed  cost  F when  ads  are  on  but  saves  on  the 
variable  cost  C(u)  when  there  is  no  advertising.  However,  if 
pulsing  costs  are  quite  high  then  the  advertiser  may  find  it 
profitable  to  not  alternate  at  all  and  maintain  an  even 
schedule.  Similarly,  if  there  are  no  pulsing  costs  at  all 
(P=0) , then  clearly,  since  more  frequent  pulsing  is 
desirable  to  less,  chattering  can  be  employed  profitably. 
Their  above  theorem  generalizes  Sasieni's  (1971,  1989) 
results . 

It  is  important  to  understand  why  Hartl's  theorem  does 
not  conflict  with  Hahn  and  Hyun' s results.  This  is  because 
of  the  presentation  of  the  formal  control  problem  appears 
like  a single  state  problem,  although  it  is  really  a two- 
state  problem.  The  second  state  is  a 0-1  variable  for 
absence-presence  of  pulsing  costs  at  switching  transitions 
in  the  summation  set  I (Hahn  and  Hyun  1991,  p.160) . 

Again,  no  empirical  analysis  is  provided.  Hahn  and  Hyun 
(1991,  p.166)  do  recognize  the  need  "...to  develop  a 
computational  procedure  of  the  optimal  periodic  pulsing 
...using  numerical  approach  suggested  by  Evans  et  al. 
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(1987)..."  However,  it  is  not  clear  how  to  determine  fixed 
and  pulsing  costs,  although  variable  costs  can  be  equated  to 
media  prices,  in  order  to  implement  their  best  periodic 
policy  for  some  real  brand. 

Feinberg  (1992) 

This  model  passes  a chattering  policy  through  an 
exponentially  averaging  filter  to  induce  inertia  on  its 
motion,  thus  forcing  optimality  on  some  finite-frequency 
pulsing  policy.  This  mathematical  trick  was  employed  by 
Luhmer  et  al . (1988),  in  their  use  of  goodwill  state 

variable  (see  equation  9) . 

Feinberg  (1992)  solves  the  following  problem 
numerically. 

Maximize  — ] [A(t)  - u(t)  ] dt 

u(t)  x o 

dA 

— = pf(z)  a - A)  - 5A 
dz 

— = T]u(t)  - r\z 
dt 

where  z(t)  is  the  filtered  variable  for  input  u(t).  (Compare 
equation  9 in  Luhmer  et  al.  1988.)  The  S-shaped  function  was 
assumed  as  f(x)=  x“/ (y+xp)  . The  following  specific 
constellation  of  parameters  was  assumed:  p=4,  5=1,  y=33/44, 
a=2.5,  and  P=2 . 

A policy  u(t)  is  defined  by  parameters  {u,  x,  p)  where 
u is  the  spending  rate  when  ads  are  on,  x is  the  length  of 
the  period,  and  p is  the  fraction  of  the  planning  horizon 
when  advertising  is  on. 
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Numerically  it  was  found  that  a periodic  policy,  which 
spends  about  45%-55%  of  the  planning  horizon  at  nonzero 
spending  level  and  the  remaining  time  with  zero  spending,  is 
superior  to  both  chattering  or  even  schedules.  Such  was  the 
case  only  for  large  values  of  r\  (greater  than  6)  . 

It  may  be  noted  that  there  is  no  behavioral 
interpretation  of  the  filtered  variable  z (i.e.,  is  it 
goodwill?  adaptation  level?  How  is  it  any  different?),  the 
required  S-shaped  f (u)  is  not  prevalent  in  practice  (J. 

Simon  and  Arndt  1980) , and  the  results  are  specific  to  the 
assumed  constellation  of  parameters,  which  are  not  estimated 
for  some  real  brand. 

Competitive  models 

Employing  a discrete  time  Lanchester-type  framework  for 
advertising  competition  (see,  e.g.,  Erickson  1985),  Park  and 
Hahn  (1991)  show  that  pulsing  is  superior  to  even  spending 
even  if  the  response  function  is  concave.  The  importance  of 
this  result  is  due  to  the  following.  All  previous  models  so 
far  have  asserted  that  an  even  schedule  is  the  best  policy 
in  the  presence  of  diminishing  marginal  returns.  But 
practice  prefers  pulsing  "universally"  (Jones  1995) . This  is 
the  first  study  which  identifies  a key  factor  that  makes 
pulsing  superior  to  an  even  schedules:  competition.  However, 
their  conclusion  needs  to  be  supported  by  empirical 


evidence . 
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Villas-Boas  (1993)  shows  that,  given  that  firms  use 
pulsing  schedules,  it  is  beneficial  for  firms  to  pulse  'out- 
of-phase'  to  maximize  industry  profits.  In  other  words, 
firms  should  mutually  alternate  advertising  spending  among 
themselves.  Such  a collusive  behavior  also  constitutes  a 
Markov-perfect  equilibrium  of  the  infinite  horizon  game.  He 
also  provides  empirical  evidence  across  several  product 
categories . 

A summary  of  pulsing  literature 

The  managerial  need  for  planning  better  pulsing 
schedules  has  provided  impetus  to  investigate  the 
comparative  performance  of  various  media  schedules  (Strong 
1977,  Zielske  and  Henry  1980) . This  stream  of  research  has 
reached  the  following  conclusions. 

First,  the  response  to  advertising  effort  is 
characterized  by  the  law  of  diminishing  marginal  returns, 
i.e.  concave  shape  (J.  Simon  and  Arndt  1980).  While  pulsing 
is  most  preferred  schedule  (Jones  1995),  there  is  no  clear 
planning  guidelines  available  (Corkindale  and  Newall  1975, 
Zielske  and  Henry  1980,  Bly  1993) . Practice  still  continues 
to  employ  ad  hoc  pulsing  schedules  (rather  than  even)  and 
seeks  guidance  on  the  planning  issue  (Kreshel  et  al.  1985, 
Lancaster  et  al.  1986,  Malkin  1993). 

On  the  theoretical  side,  Sasieni's  (1971,  1989) 
brilliant  analysis  closed  the  chapter  on  pulsing  schedules 
by  showing  that  for  a broad  class  of  response  functions  an 
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even  schedule  is  the  best  allocation  policy.  The 
monotonicity  theorem  due  to  Hartl  (1987)  complements  his 
view.  This  intrigued  marketing  scholars  as  to  why  practice 
prefers  pulsing.  The  following  reasons  were  identified. 

Convexity  and  asymmetry  of  response  functions  drive 
chattering  as  the  optimal  policy  (in  continuous  time 
models) . Averaging  filters  can  be  employed  to  induce  lag  on 
the  motion  of  the  chattering  policy  in  order  to  force 
optimality  on  some  finite-frequency  pulsing  schedule  (e.g. 
Luhmer  et  al.  1988,  Feinberg  1992) . 

Competitive  forces  reward  'out-of-phase'  pulsing 
behavior  (Villas-Boas  1993)  and  can  yield  pulsing  as  an 
optimal  schedule  in  the  presence  of  concave  response 
functions  (Park  and  Hahn  1991)  . 

Some  institutional  rigidities  in  media  buying  practices 
can  drive  the  preference  for  pulsing.  For  e.g.,  if  there  are 
start-up  cost  to  pulsing  which  are  small  relative  to  fixed 
costs  of  staying  on  all  the  time  (in  addition  to  the 
variable  costs  of  media  time  or  space) , then  pulsing  can  be 
beneficial  (Hahn  and  Hyun  1991) . Table  2-3  summarizes  the 
pulsing  literature  reviewed  above. 

While  these  explanations  are  elegant  and  satisfy 
intellectual  curiosity,  the  practice  of  advertising  still 
begs  for  guidance  on  planning  the  'best'  pulsing  schedule. 
This  maybe  partly  because  there  is  hardly  any  empirical 
support  for  the  model-based  analysis  (see  table  2-3)  or  any 
demonstration  of  model  implementation  in  practice. 


39 


Thus,  within  the  classical  framework  for  monopolists 
(e.g.,  Sasieni  1971,  1989,  Mahajan  and  Muller  1986),  so  far 
there  is  no  explanation  as  to  why,  or  indeed  should,  firms 
prefer  pulsing  to  an  even  schedule  in  the  presence  of 
diminishing  marginal  returns  to  advertising  efforts?  If  so, 
what  should  be  the  best  plan  of  pulsing  over  time? 

Advertising  Wearout,  Ad  Quality,  and  Overlearning 
Advertising  Wearout 

Advertising  wearout  is  of  concern  to  researchers  and 
managers  alike.  Several  excellent  reviews  are  available  on 
this  subject  (see  Greenberg  and  Suttoni  1973,  Kamins  1981, 
Pechman  and  Stewart  1990) . Hence  only  a brief  review  of  this 
literature  follows. 

What  is  ad  wearout? 

In  the  advertising  literature  (e.g.  Weilbacher  1970, 
Greenberg  and  Suttoni  1973,  Corkindale  and  Newall  1978, 
Axelrod  1980,  Kamins  1981,  Blair  1987,  Pechman  and  Stewart 
1990) , the  term  advertising  wearout  refers  to  the  decline  in 
the  quality  of  the  advertisements . A term  such  as  potential 
effectiveness  or  just  effectiveness  of  advertisements 
expresses  the  meaning  more  clearly,  although  I will  use  the 
term  'ad  quality'  to  be  consistent  with  this  literature. 


Table  2-3.  A summary  of  pulsing  literature 
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Table  2-3.  A summary  of  pulsing  literature  (continued). 
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In  a survey  of  advertisers  and  ad  agencies  in  the  U.K., 
a majority  of  respondents  think  of  advertising  wearout  as  a 
drop  in  the  quality  of  advertisement  due  to  boredom  with  the 
advertisement  (Corkindale  and  Newall  1978,  p.353). 

What  causes  wearout? 

Broadly  speaking  there  are  two  types  of  advertising 
wearout:  copy  wearout  and  repetition  wearout.  Copy  wearout 
is  because  of  the  passage  of  time  advertisements  get  dated, 
while  repetition  wearout  is  due  to  the  frequency  of  exposure 
which  results  in  irritation  or  inattention  or  boredom. 

Copy  Wearout.  There  are  several  ways  in  which 
advertisement  copy  wears  out  over  time.  For  example,  when 
the  advertisement  copy  is  new  it  provides  information  on 
search  attributes,  while  over  time  consumers  acquire 
experience  with  the  brand  and  the  impact  of  advertising 
dilutes  (Assmus,  Farley,  and  Lehmann  1984,  Lodish  et  al . 
1995)  . 

Sometimes  ad  copy  wears  out  because  advertising  style 
gets  imitated  over  time;  hence,  there  is  lower  perceived 
contrast  among  campaigns  (Groenhaug  et  al.  1991,  p.44).  In  a 
sketch  of  200  years  of  American  advertising,  Goodrum  and 
Dalrymple  (1990)  give  several  examples  for  ad  style 
imitation  or  'copy-catting.'  For  example,  in  just  a single 
month,  they  note,  several  brands  from  different  product 
categories  came  up  with  the  claim  "All  fiber  is  not  created 
equal"  {Metamucil) , "All  Calories  Are  Not  Created  Equal" 
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(Campbell's  Soup),  "All  Gold  Is  Not  Created  Equal"  (Visa), 

and  "All  cigarettes  are  not  created  equal"  (Kool) . Thus, 

copy-catting  intensifies  competition  for  consumers' 

attention  thereby  reducing  the  effectiveness  of  ads. 

Other  reasons  for  copy  wearout  follows:  novelty  of  the 

advertisement  declines  (Axelrod  1980) , persuasiveness  of  the 

message  may  diminish  (Blair  1987;  Conlin  1994),  popularity 

of  the  celebrity  in  the  advertisement  may  erode. 

Repetition  Wearout.  Consumer  behavior  literature 

suggests  that  repetitive  advertising  enhances  ad  wearout. 

For  example,  Pechman  and  Stewart  (1990,  p.14)  summarizing 

several  studies  on  advertising  repetition  note  that 

...  as  the  exposure  rate  increases  and  the  exposures 
become  increasingly  massed,  wearout  becomes 
increasingly  likely  even  under  ordinary  viewing 
conditions . More  specifically,  advertising  at  a high 
rate  may  be  no  more  effective  — or  even  less  effective 
— than  advertising  at  a low  rate  (Tellis  1988;  Batra 
and  Ray  1986) . (Emphasis  in  the  original) 

Some  other  related  explanations  for  repetition  wearout 

are  as  follows. 

Irritation.  One  view  is  that  wearout  sets  in  because  of 
irritation  or  tedium  (Berlyne  1970) . Greyser  (1973)  notes 
that  irritation  in  advertising  is  positively  related  to 
frequency  of  repetition,  intensity  of  spending,  similarity 
of  ad  executions  and  negatively  related  to  message  rotation 
(number  of  ad  copies  in  a campaign) . 

Inattention.  Alternatively,  advertisements  wear  out  due 
to  inattention  of  the  audience  to  the  repeated  exposure.  For 
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example,  in  a controlled  experiment  by  Craig,  Sternthal,  and 
Leavitt  (1976),  it  was  observed  that  brand  name  recall 
declined  significantly  when  exposures  exceeded  the  number 
needed  to  learn  brand  names.  When  attention  to  the 
advertisements  was  experimentally  induced,  the  wearout 
previously  observed  was  eliminated. 

A novel  explanation,  due  to  Cacioppo  and  Petty  (1979), 
is  that  increase  in  repetition  from  low  to  moderate  enhances 
agreement  with  message  advocacy,  whereas  additional 
exposures  result  in  a decline  in  agreement  because  negative 
thoughts  exceed  the  positive  ones. 

What  are  its  consequences? 

The  main  consequence  of  wearout  is  the  decline  in  the 
aggregate  brand  awareness.  Based  on  extensive  study  of 
consumer  response  to  advertising  (Bauer  and  Greyser  1968), 
Weilbacher  (1970)  raises  a thought-provoking  question:  What 
happens  to  advertisements  when  they  grow  up?  He  notes  that 
(p.  219-20) 

...advertisement  effectiveness  gradually  decays  or 
wanes.  At  some  point  in  time,  it  is  believed,  an 
advertisement  loses  its  sharp  cutting  edge.  ...  Is  it 
probable  that  each  advertisement  has  a life  cycle  all 
its  own  and  that  its  impact,  interest,  and  effect  vary 
over  its  life?  When  an  advertisement  is  fresh  and  new, 
it  must  have  a close  to  maximum  opportunity  to 
entertain  or  inform.  Fresh  information,  in  or  out  of  an 
appealing  context,  can  become  stale  fairly  quickly.  A 
gentle  amusing  exposition  loses  its  charm  after 
repeated  exposure.  ...the  individual  advertisement  is 
not  perceived  as  an  enduring  work  of  art,  but  rather  as 
a vehicle  of  communication  that  is  doomed  to  ultimate 
ineffectiveness  as  surely  as  the  butterfly  is  doomed  to 
die.  ...inevitable  death  in  the  world  of  consumers. 
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This  is  the  idea  of  the  life-cycle  of  advertisement  in 
the  presence  of  wearout. 

How  to  forestall  wearout? 

Broadly  speaking,  there  are  two  strategies  available  to 
the  advertisers:  media  scheduling  and  copy  execution. 

Media  scheduling.  Kamins  (1981)  reviews  the  literature 
that  addresses  the  scheduling  and  timing  aspects  of  this 
question  (p.  71) . Research  by  Grass  and  Wallace  (1969) 
suggest  that  increasing  the  variety  in  a campaign  allows  the 
advertiser  to  alternate  the  viewing  sequence  over  time, 
thereby  forestalling  ad  wearout.  However,  Kamins  points  out 
that  "this  research  leads  to  the  question  whether 
alternation  alone  can  cause  a forestalling  in  wearout,  or 
indeed  is  the  observed  effect  due  to  greater  time  period 
between  commercial  airings  ..."  (p.  70). 

Attempting  to  resolve  this  issue,  the  subsequent 
research  suggests  that  spacing  between  ads  is  crucial  in 
forestalling  wearout.  In  a controlled  experiment,  Calder  and 
Sternthal  (1980)  found  significant  effects  of  flight  length 
and  spacing  on  several  cognitive  and  affective  measures  for 
the  commercial  as  well  as  product  evaluations.  Even  in  field 
studies.  Grass  and  Wallace  (1969)  show  that  attention  is 
positively  related  to  the  time  span  between  commercial 
airings . 

Silence  is  golden.  The  insight  from  this  literature  is 
that  there  is  benefit  to  having  a gap  between  media  bursts: 
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spacing  enhances  attention.  This  inter-burst  spacing  not 

only  forestalls  wearout  but  also  can  restore  ad  quality.  As 

Grass  and  Wallace  (1969,  p.8)  observes 

Regeneration  of  attention  or  interest  level  is  possible 
after  commercials  have  passed  the  satiation  point  if 
they  are  removed  from  the  air 

Greenberg  and  Suttoni  (1973,  p.53)  hold  a similar  view: 

A commercial  that  is  running  for  a while  can  be  removed 
and  reintroduced  after  a time  and  take  on  a sense  of 
newness . 

Some  theoretical  views  and  empirical  evidence  from 
retail  price-promotions  context  also  suggests  that  inter- 
burst hiatus  can  be  beneficial.  Corkindale  and  Newall  (1978, 
p.334)  explains  that  ad  quality  regenerates  because  people 
forget  the  advertisement  messages  and,  hence,  the  greater 
the  forgetting,  the  more  the  enhancement.  There  is  also 
empirical  support  to  a similar  idea  that  spacing  between 
price  promotions  can  be  enhance  price  elasticity  for  certain 
class  of  products  (Achabal,  McIntyre,  Smith  1990;  Rao  and 
Thomas  1973) . In  other  words,  price  promotions  are  more 
effective  when  successive  promotions  are  scheduled  further 
apart  in  time. 

Executional  factors.  Several  ad  characteristics  can 
forestall  ad  wearout.  Research  shows  that  emotional 
campaigns  wearout  more  slowly  than  campaigns  that  attempt  to 
persuade  with  verbal  arguments  (Maclnnis  and  Price  1987). 
Longer  TV  commercials  (say  90")  wear  out  more  rapidly  than 
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the  regular  30"  ones  (Rethans,  Swasy,  and  Mark  1986,  Petty 
and  Cacioppo  1986) . 

For  print  advertisements,  Sawyer  (1973)  found  that  the 
use  of  two-sided  claims  (refutational  format)  produced 
greater  recall  than  one-sided  arguments  (supportive  format) . 
This  could  be  due  to  the  greater  complexity  of  refutational 
ads,  consistent  with  Berlyne  (1970) . Further,  the  wearout 
rate  was  lower  for  the  refutational  format  compared  to  the 
supportive  format. 

Thus  commercials  which  depend  on  gags  or  punch-lines 
seem  to  wear  out  quickly  as  the  commercial  becomes  familiar 
(Greenberg  and  Suttoni  1973) . Relatedly,  Ray  and  Sawyer 
(1971)  found  that  while  "grabber"  or  hard-sell  ads  produced 
better  recall  compared  to  the  soft-sell  ones,  their 
uniqueness  wears  out  with  repeated  exposures.  This  stream  of 
research  has  potential  to  help  produce  longer  lasting 
campaigns . 

Quality  of  Advertisements 
What  is  it?  A definition 

The  notion  of  advertising  quality  was  first  introduced 
by  Buzzell  (1964) . Along  similar  lines.  Parsons  and  Schultz 
(1976,  p.  85)  advocate  a notion  of  quality  adjustment 
factor,  denoted  pt/  which  captures  the  role  of  the  quality 
of  an  advertisement,  over  and  beyond  the  amount,  or 
quantity,  of  media  spending  (denoted  ut)  . Then,  they  view 
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advertising  impact  at  time  t as  the  product  of  the  quality 
and  quantity  factors,  namely,  ptut. 

In  this  dissertation,  ad  quality  will  be  defined  as  a 
time-varying  response  coefficient  for  the  effects  of  media 
weight  in  aggregate  response  models.  In  other  words,  the 
effects  of  media  spending  ut  is  moderated  by  the  quality  of 
advertisement  pt . As  mentioned  earlier,  a more  intuitively 
meaningful  term  for  ad  quality  is  'effectiveness'  or 
'potential  effectiveness'  although  I will  use  the  term  ad 
quality  in  this  dissertation  to  remain  consistent  with  this 
literature . 

Pekelman  and  Sethi  (1978)  incorporates  the  above 
construct  of  ad  quality,  as  declining  response  coefficient 
over  time,  in  a model  to  determine  when  an  advertisement 
copy  should  be  replaced.  Similarly,  Gross  (1972)  uses  the 
notion  of  the  distribution  of  quality  of  advertisement  to 
determine  the  number  of  ideas  that  should  be  considered  for 
screening. 

The  above  definition  is  consistent  with  Little's  (1979, 
p.644)  view  that  dynamic  models  of  advertising  response 
should  recognize  " ...  the  dollar  effectiveness  of 
advertising  can  change  over  time  as  the  result  of  changes  in 
media,  copy,  and  other  factors." 

Mahajan,  Muller  and  Sharma  (1984)  hold  a similar  view 
of  ad  quality  in  awareness  generation  models.  For  example, 
in  their  comparative  analysis  of  awareness  forecasting 
models,  they  say  that  (p.180) 
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. . . the  awareness  level  due  to  advertising  generated  in 
a given  time  period  depends  upon  the  amount  of 
advertising  dollars  and  the  quality  of  the  advertising 
campaign  . . . 

Finally,  Arnold  et  al.  (1987)  provide  strong  empirical 
support  to  this  formulation  of  ad  quality.  They  also  extend 
the  construct  of  ad  quality  by  letting  pt  to  be  a function 
of  attributes  of  ad  copy. 

Why  is  ad  quality  important 

Copy  management  is  an  important  aspect  of  advertising 
which  has  received  limited  attention  from  researchers  and 
practitioners  (Lodish  et  al.  1995).  Some  early  studies  (e.g. 
Buzzell  1964)  as  well  as  more  recent  ones  (e.g.  Eastlack  and 
Rao  1984,  1989;  Blair  1987;  Blair  and  Rosenberg  1994)  have 
repeatedly  emphasized  the  importance  of  advertisement  copy 
and  its  quality  relative  to  the  expenditure  or  'weight' 
related  factors. 

Lodish  et  al.  (1995)  emphasize  the  importance  of  copy 
effects.  The  field  studies  by  Adams  and  Blair  (1992)  at 
Campbell  Soup  also  show  that  the  market  share  points  are 
related  to  the  quality  of  the  ad  copy,  and  a 'fresh'  ad  copy 
results  in  gain  in  market  share.  Also  see  Eastlack  and  Rao 
(1989) . Similar  experience  is  reported  by  the  Goodyear  Tire 
and  Rubber  Company  for  their  successful  Aquatread  brand 
campaign  (Conlin  1994) . Bloom,  Jay  and  Twyman  (1977)  report 
the  case  of  Horlicks  brand  where  sales  increase  is 
attributed  to  the  change  in  ad  copy,  since  media  weight  was 
held  constant.  Hence,  it  is  important  to  explicitly 
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incorporate  the  role  of  time-varying  advertising  quality  in 
aggregate  response  models. 

Overlearning 

Besides  enhancing  wearout,  repetition  performs  an 
additional  role  which  is  usually  neglected  in  the  modeling 
literature.  Ad  repetition  results  in  more  learning  which 
reduces  the  likelihood  of  forgetting  (Johnson  and  Watkins 
1971).  This  is  because  repetition  serves  as  a reminder  and 
forestalls  forgetting  (Cromwell  and  Kunkel  1952;  Cook  and 
Insko  1968)  . 

Thus  overlearning  is  a phenomenon  in  which  forgetting 
rate  decreases  as  a function  of  cumulative  stock  of 
repetitive  efforts.  A good  example  of  overlearning  is  a 
pledge  of  allegiance  to  some  social  club  or  an  institution 
(e.g.  national  anthem) . 


CHAPTER  3 

A MODEL  OF  ADVERTISING  WEAROUT  AND  EMPIRICAL  SUPPORT 

Model  Development 

In  this  chapter  I formulate  the  ad  wearout  model  and 
provide  empirical  support  for  the  formulation.  The  first 
half  is  devoted  to  model  development,  and  the  next  half  for 
empirical  analysis.  The  model  development  proceeds  in  four 
sections.  First,  I model  evolution  of  brand  awareness  in 
response  to  advertising.  Then  I model  the  advertising 
wearout  phenomenon  and  incorporate  the  role  of  media  hiatus 
in  ad  quality  evolution.  Then  I embed  ad  quality  dynamics  in 
awareness  evolution  model.  Finally,  I extend  the  model  to 
show  how  to  incorporate  overlearning  phenomenon.  The 
objective  here  is  to  build  a model  that  can  yield  pulsing 
pattern  of  spending  as  an  optimal  policy  and  to  show  that 
such  a model  describes  empirical  data  on  brand  awareness 
reasonably  well  compared  to  several  other  awareness 
evolution  models  in  the  literature. 

Awareness  Formation 

Mahajan,  Muller,  and  Sharma  (1984)  review  several 
awareness  formation  models,  which  are  essentially  similar  to 
the  classic  ones  due  to  Nerlove  and  Arrow  (1962)  and  Vidale- 
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Wolfe  (1957) . Consistent  with  that  literature,  I model 
awareness  formation  by  the  differential  equation 

dA 

— = q(t)  g(u)  - 5 A ( 1 ) 

dt 

where  A(t)  is  the  awareness  at  time  t,  q(t)  is  time-varying 
response  coefficient,  5 is  the  forgetting  rate,  and  u(t)  is 
the  plan  of  spending  media  dollars  over  time. 

This  model  reduces  to  the  classical  Nerlove-Arrow' s 
(1962)  model,  when  the  response  coefficient  q(t)  is  constant 
over  time,  i.e.  q(t)  = P for  all  t and  when  response 
function  is  linear,  i.e.  g(u)  = u. 

Equation  (2)  says  that  the  increase  in  awareness  over  a 
small  time  period,  dA/dt,  is  due  to  advertising  spending 
u(t),  and  its  decreases  is  due  to  forgetting,  which  is 
proportional  to  the  level  of  awareness  (see  Mahajan,  Muller, 
and  Sharma  1984  for  a similar  interpretation) . The  term 
q(t)u(t)  can  be  interpreted  as  advertising  impact, 
consistent  with  the  previous  research  (see,  e.g.,  Parsons 
and  Schultz  1976,  p.85).  Within  the  context  of  this 
literature,  q(t)  is  interpreted  as  a 'quality  adjustment 
factor'  or  simply  advertising  quality  (see  literature 
review;  Buzzell  1964,  Little  1975,  Parsons  and  Schultz  1976, 
Pekelman  and  Sethi  1978,  Arnold  et  al.  1987) . 

Advertising  Wearout 

Little  (1979,  p.644)  notes  that  dynamic  models  of 
advertising  response  should  recognize  that  "the  dollar 
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effectiveness  of  advertising  can  change  over  time  as  the 
result  of  changes  in  media,  copy,  and  other  factors." 
Consistent  with  this  view,  I explicitly  specify  the  dynamics 
for  advertising  quality  q(t)  . 

It  is  empirically  observed  that  the  quality  of 
advertisement  can  decline  over  time  (for  e.g.  Parsons  1975; 
Pekelman  and  Tse  1976,  1980;  Arora  1979;  Erickson  and 
Montgomery  1979) . That  is,  dq/dt  < 0 for  some  time  period. 
The  deterioration  in  ad  quality  will  be  termed  advertising 
wearout,  consistent  with  the  advertising  literature  (see 
chapter  2;  Grass  1968,  Greenberg  and  Suttoni  1973,  Pekelman 
and  Tse  1976,  Pekelman  and  Sethi  1978,  Corkindale  and  Newall 
1978,  Axelrod  1980,  Pechman  and  Stewart  1990) . Hence,  when 
advertising  is  on,  the  ad  quality  evolution  is  modeled  by 
the  differential  equation 

— = - a (u)  q ( 2 ) 

dt 

where  a(u)  is  the  rate  at  which  advertisements  wear  out. 

The  rate  a(u)  will  be  assumed  linear  in  the  spending 
rate,  i.e.  a(u)  = c + w u(t).  As  discussed  in  the  literature 
review,  advertising  wearout  has  two  components:  copy  wearout 
and  repetition  wearout.  Here  c can  be  interpreted  as  a 
parameter  for  copy  wearout  because  it  intrinsically  depends 
on  the  characteristics  of  the  advertisements  regardless  of 
the  frequency,  repetition,  or  media  spending;  that  is,  a(0) 

= c.  The  parameter  w can  be  interpreted  as  repetition 
wearout  because  it  captures  the  role  of  frequency  or 


54 


spending  which  enhances  advertising  wearout;  that  is,  da/du 
= w. 

In  this  discussion,  q(t)  is  the  quality  of  the 
advertisement  message  or  theme,  consistent  with  the 
behavioral  literature  (e.g.  Grass  and  Wallace  1969, 

Greenberg  and  Suttoni  1973,  Kamins  1981,  Blair  1987,  Pechman 
and  Stewart  1990)  as  well  as  the  modeling  literature  (e.g. 
Gross  1972,  Pekelman  and  Sethi  1978)  reviewed  earlier.  It  is 
possible  to  extend  the  present  formulation  to  incorporate 
the  quality  of  individual  ad  executions,  i.e.  variations  on 
the  same  theme. -*- 


Ad  Quality  Evolution  During  Media  Hiatus 


Several  advertising  researchers  have  noted  that  ad 
quality  can  restore,  regenerate,  or  take  on  a sense  of 
newness  when  re-introduced  after  a hiatus  in  media  spending 
(see  Grass  and  Wallace  1969,  Greenberg  and  Suttoni  1973) . 


1 Suppose  there  are  several  ad  variants  of  a given  theme, 
e.g.  a pool  of  commercials  for  a campaign.  Let  qit  be  the 
quality  of  the  individual  ad  variant  i = execution  A, 
execution  B,  execution  C,  etc.  Denote  Dlt  as  a dummy  variable 
that  takes  value  of  one  when  ad  execution  i is  telecast  and 
zero  otherwise.  Then  the  overall  qt  = E Ditqlt  . The  quality 

evolution  for  each  ad  variant  can  be  specified,  in  general, 
by  the  difference  equation  Aqit  = h(qit,  qjt,  ult,  ujt)  for  all 
i.  Here,  subscripts  i,  j denotes  different  ad  variants,  and 
uit  and  Ujt  are  their  respective  spending  schedules.  Note 
that  this  equation  says  that  the  quality  of  a variant 
depends  on  its  current  quality  and  spending  patterns  as  well 
as  those  of  other  ad  executions.  Although  I abstract  away 
from  these  details  here,  such  analyses  will  have 
implications  for  copy  rotation  plan  within  a given 
advertising  burst. 
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Corkindale  and  Newall  (1978,  p.  334)  suggest  that  the 
greater  the  forgetting,  the  more  the  enhancement  of  ad 
quality.  Thus,  we  can  model  the  change  in  ad  quality  by  the 
following  differential  equation: 
dq 

— = -a(u)q  + (1  - I(u)  ) 8 (1  - q)  ( 3 ) 

dt 

where  I (u)  is  a variable  which  indicates  whether  advertising 
is  on  (I ( u ) =1  when  u * 0)  or  off  (l(u)=0  when  u = 0) . 

When  advertising  is  on,  u * 0 which  implies  I(u)=l.  On 
substitution  I(u)=l  in  (3)  and  simplifying  it  can  be  seen 
that  equation  (3)  is  exactly  the  same  as  equation  (2)  — ad 
quality  wears  out  at  the  rate  a(u). 

When  advertising  is  off,  u = 0,  hence,  I (u)  = 0. 

Similar  substitution  and  simplification  will  yield  the 
following  evolution  for  ad  quality. 

— = -cq  + 8 (1  - q)  (3.1) 

dt 

In  equation  (3.1) , 

• copy  wearout  continues  since  it  is  a temporal 
phenomenon  which  does  not  depend  on  spending, 

• repetition  wearout  ceases  because  advertising  is 
off, 

• ad  quality  regenerates  at  the  rate  proportional  to 
the  forgetting  rate  8 

Whether  ad  quality  for  some  real  advertisement 
rejuvenates  or  not  is  an  empirical  issue,  which  can  be 
verified  using  awareness  tracking  data  (see  estimation 
results  later  in  this  chapter) . 
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Overlearning 


While  it  is  recognized  that  repetition  enhances 
wearout,  the  benefit  of  repetition  is  usually  ignored,  i.e., 
repetition  can  lower  forgetting.  This  phenomenon  is  termed 
overlearning  (Cromwell  and  Kunkel  1952,  Cook  and  Insko  1968, 
Johnson  and  Watkins  1971) . The  implication  is  that 
forgetting  rate  is  not  constant;  it  decreases  as  a function 
of  effective  advertising  efforts. 

These  ideas  can  be  mathematically  expressed  in  the 
following  equations. 


6(u  ) = d0e-diu 
du* 


dt 


= X(u  - u*) 


(4) 


where  u*  (t)  is  an  exponentially  smoothed  average  of  previous 
advertising  spending  u(t).  Parameters  d0,  di,  and  X are 
positive  constants. 

In  words,  equation  (4)  says  that  forgetting  rate  5 
decreases  with  effective  advertising  spending  u*;  that  is, 
dS/du * < 0.  Following  Little  (1975),  the  effective 
advertising  effort  is  obtained  by  passing  actual  spending 
u ( t ) through  the  averaging  filter2  described  by  the 
differential  equation  in  (4) . This  is  consistent  with  the 


2 Some  common  terms  can  get  cryptic  in  continuous  time 
because  there  is  no  such  thing  as  "the  last  period  t-1.  To 
understand  this  usage,  consider  the  equivalent  discrete 
version  of  the  differential  equation  in  (4)  is  u t~u  t-i  = 
Mut-u*t-i)  • 0n  rearranging,  u*t  = ^ut  + (l-A,)ut-i-  Note  that 
in  the  last  equation,  the  current  spending  ut  and  previous 
efforts  u*t-i  are  'averaged'  to  obtain  the  current  effective 
advertising  effort  u*t.  Hence,  the  term  'averaging  filter'. 
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pulsing  literature  (Little  1975,  Luhmer  et  al.  1988, 

Feinberg  1992).  Essentially,  u*(t)  can  be  interpreted  as  the 
"effective"  spending  rate,  as  does  Little  (1975) . 

Model  Summary 

The  growth  in  brand  awareness  depends  on  the  quality 
and  the  quantity  of  advertising.  The  quality  of  advertising 
moderates  the  growth  in  awareness  due  to  media  spending. 

Ad  quality  can  deteriorate  for  several  reasons.  This 
decline  in  ad  quality  is  termed  advertising  wearout.  Broadly 
speaking,  there  are  two  sources  of  ad  wearout:  copy  wearout 
and  repetition  wearout.  For  example,  copy  wearout  occurs 
when  advertisements  get  dated  due  to  the  passage  of  time, 
while  repetition  wearout  occurs  when  the  audience  gets  bored 
due  to  excessive  frequency. 

It  is  believed  that  ad  quality  can  also  restore  during 
media  hiatus  because  people  forget  and,  hence,  take  on  a 
sense  of  newness  when  re-introduced  again.  Both  rise  and 
fall  in  ad  quality  in  response  to  the  pattern  of  media 
spending  is  thus  captured  in  the  model. 

In  addition,  the  often  overlooked  benefit  of 
advertising  repetition  — overlearning  — is  also 
incorporated  in  the  model . 
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Empirical  Analysis 

Now  I will  test  the  plausibility  of  proposed  ad  wearout 
model  using  data  from  field  experiments  carried  out  by 
Robert  Grass  and  his  colleagues  at  E.I.  du  Pont  De  Nemours  & 
Co.  In  the  next  section,  I will  provide  a more  definitive 
empirical  support  to  the  model  formulation  and  test  its 
relative  superiority  using  awareness  tracking  data  on  cereal 
brand  advertising  in  U.K. 

Field  Experiment  Data 

Here  I will  briefly  describe  the  wearout  experiments 
conducted  by  Grass  (1968),  the  estimation  procedure,  and  the 
estimation  results. 

Description  of  wearout  experiments 

Grass  (1968)  conducted  several  advertising  experiments 
to  learn  about  wearout  characteristics  of  advertisements.  In 
one  such  study,  as  reported  by  Greenberg  and  Suttoni  (1973), 
he  subjected  matched  samples  of  consumers  to  continued 
exposure  to  a consumer  product  television  advertisement,  for 
the  three-month  experimental  period,  at  varying  exposure 
intensity:  low  (1-3  ads /month ) , medium  (4-6  ads/month),  and 
heavy  exposure  (7-12  ads/month) . A control  group,  who  did 
not  see  any  advertisements,  was  also  tracked  during  this 
period. 

The  dependent  measure  was  brand  awareness,  which  was 
operationalized  as  the  percentage  of  the  respondents  who 
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mentioned  the  brand  name  first.  Telephone  interviews  of 
verified  viewers  were  conducted  24  hours  after  exposure. 
Brand  awareness  was  measured  on  four  occasions,  once  at  the 
beginning  and  then  at  the  end  of  each  month,  for  each  of  the 
three  test  groups  as  well  as  the  control  group.  It  was 
observed  that  the  control  group  had  approximate  awareness 
level  of  2.9%  (maybe  due  to  non-advertising  marketing 
activities)  throughout  the  experimental  period.  A plot  of 
the  data  are  shown  in  Figure  3-1. 


Wearout  Curves 


Figure  3-1.  Awareness  curves  over  time  under  constant 
exposure  rate. 
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Estimation  approach 

The  estimation  equation  for  the  ad  wearout  model  under 
constant  exposure  rate,  derived  in  Appendix  E (see  equation 


E.3),  is  given  below, 

g(Ui)  [e-a(Ui,ti  - e-6ti] 

A(ti  , UjJ  = — 

6 - a(ui) 


+ Ane  8ti  + e 


!5) 


where  i is  a subscript  for  data  point  i = 1,  2,  ...,  N,  N = 

12,  and  e±  ~ N(0,a2)  for  all  i.  That  is,  I have  assumed  that 
there  is  no  heteroscadesticity  or  autocorrelation  to 
conserve  degrees  of  freedom.  Ao  is  the  initial  awareness 
level  at  time  t = 0,  which  will  be  estimated  as  a parameter. 
Consistent  with  the  theoretical  model,  I have  assumed  that 
wearout  rate  is  a linear  function  of  exposure  levels,  i.e. 
a (Ui)  = c + w Ui  . 

In  order  to  determine  the  shape  of  the  ad  response 
function  g(Ui),  consider  the  following  function: 


g(ui)  = (1  + e~a/Ui)P  - 1 (6) 

It  can  be  shown  that  g(ui)  is  concave  for  small  a,  large  P, 
and  it  is  S-shaped  for  large  a,  small  (3  (see  Forker  and  Ward 
1993,  p.163).  Also  note  that  g(0)  = 0 as  Ui  approaches  zero, 
as  required. ^ Finally,  forgetting  rate  5 is  assumed  constant 
(i.e.  no  overlearning)  consistent  with  the  previous 
literature . 

The  following  log-likelihood  function  was  maximized. 


3 This  requirement  is  essential  to  allow  awareness  to  decay 
exponentially  when  advertising  is  off.  That  is,  in  equation 
(1),  dA/dt  = -8a,  only  when  g(u)=u=0.  See  Mahajan  and  Muller 
(1986)  for  details. 
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N,  „ * 1 £ (A(xi)  - f(xi  , 9)  f 

^(6)  = - — log  (271  a ) - - S 2 

2 2 i=i  a 


(7) 


where  data  vector  x±  = [t±,  uj,  parameter  vector  0 = [c,  w, 
8,  a,  3]T,  and  f(x±,0)  is  the  deterministic  component  of  (5). 
The  parameter  vector  0 is  estimated  by  maximizing  the 
concentrated  likelihood  function,  since  the  variance  a2  is 
not  known,  and  the  estimate  for  the  variance  is  given  by 
a2(0*)  = e'e/N,  where  e is  a vector  of  prediction  error  (e  = 
A(x) -f (x,0*)  ) . The  maximization  was  carried  out  using  the 
Gauss  programming  language. 

Estimation  results 

The  results  of  the  maximum  likelihood  estimation  are 
shown  below  in  table  3-1. 


Table  3-1.  Parameters  estimates  for  experimental  data. 


Ad  Wearout  Model 

Nerlove-Arrow  Model 

Parameters 

Estimates 

Standard 

Error 

Estimates 

Standard 

Error 

copy 

wearout  c 

0.1758 

0.2449 

' 

repetition 
wearout  w 

-0.0097 

0.0339 

forgetting 
rate  5 

0.1245 

0.1646 

0.7506 

0.4208 

a 

0.2546 

0.1409 

0.2107 

0.0709 

3 

2.224 

0.2731 

3.01 

06781 

Initial 
Awareness  Ao 

1.552 

1.026 

0.3736 

2.9725 

Max.  Log- 
Likelihood 

-7 . 9063 

-15.6195 

R2 

94.49 

80.08 
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It  can  be  seen  from  table  3-1  that  the  parameters  of 
the  theoretical  model  can  be  estimated  from  data,  the  signs 
are  meaningful,  and  the  fit  of  ad  wearout  model  is  quite 
good  (R2=94 . 5% ) compared  to  the  alternative  Nerlove  and 
Arrow  (1962)  model. 

Based  on  the  likelihood  ratio  test,  the  statistic 
-2SJ.  * = 15.4264  is  greater  than  the  critical  %2  = 5.99  for 
0.05  significance  level  with  2 degrees  of  freedom.  Hence, 
with  two  additional  parameters  (copy  wearout  c,  and 
repetition  wearout  w ) , the  proposed  ad  wearout  model  fits 
the  data  in  figure  3-1  significantly  better  than  the  Nerlove 
and  Arrow's  model. 

The  intuition  for  the  superior  fit  of  the  proposed 
model  is  as  follows.  A non-zero  copy  wearout  parameter 
implies  a downward  bend  in  the  awareness  curves  over  time 
(see  theoretical  results  in  chapter  4) . Any  other 
comparative  model  that  ignores  copy  wearout  will  not  predict 
the  decline  in  awareness  because  of  monotonic  growth  in 
awareness  over  time  (see  Appendix  G) . This  enhances  the 
magnitude  of  prediction  errors  when  wearout  is  present  in 
the  data;  hence,  a poorer  fit  of  the  comparative  model. 

Consistent  with  the  bulk  of  the  previous  literature  (J. 
Simon  and  Arndt  1980),  the  response  function  g(u)  has  a 
concave  shape  when  a = 0.25  and  (3  = 2.22. 
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Bootstrapping  for  small  sample  inference 

Since  the  sample  consisting  of  twelve  data  points  is 
small,  the  statistical  inference  cannot  be  reliably  based 
on  the  results  from  asymptotic  theory  (e.g.  Rao  1973) . I 
have  used  bootstrapping  methods  (see  Efron  1979)  to 
characterize  the  empirical  distribution  of  parameters  (c,  w, 
5) . Statistical  inference  is  based  on  confidence  intervals 
constructed  from  the  empirical  distribution.  The  95% 
confidence  interval  for  these  parameters  is  obtained  from 
the  25th  smallest  and  the  25th  largest  value  in  1000 
replications.  The  method  is  outlined  in  Appendix  C. 

The  histogram  for  copy  wearout  parameter  is  shown  in 
figure  3-2,  for  repetition  wearout  parameter  in  figure  3-3, 
and  for  forgetting  rate  in  figure  3-4. 


parameter  is  not  normally  distributed. 


The  95%  confidence  interval  for  copy  wearout  is 
(0.0474,  0.7609).  Since  this  interval  does  not  contain  zero. 
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the  hypothesis  that  parameter  c = 0 is  rejected.  Indeed,  c 
was  never  zero  even  once  in  1000  replications. 

However,  the  repetition  wearout  parameter  is  likely  to 
be  zero  since  the  95%  confidence  interval  is  (-0.0793, 
0.0192)  which  includes  the  origin.  Indeed,  w was  estimated 
as  a negative  number  77%  of  the  bootstrapped  samples. 


F: 

wearout  is  close  to  normal  distribution. 


jtition 


Similarly,  the  sampling  distribution  of  the  forgetting 
rate  parameter  is  not  normally  distributed  and  the  95% 
confidence  interval  for  it  is  (0.0326,  0.5314).  Hence,  the 
hypothesis  that  5 = 0 is  rejected  at  a = 0.05  significance 
level.  In  fact,  the  smallest  value  of  the  forgetting  rate 
was  found  to  be  0.019. 

From  these  data  on  advertising  wearout,  we  can  now 
conclude  that  the  model  parameters  are  identified  and  can  be 
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estimated.  The  parameters  for  copy  wearout  and  forgetting 
rate  were  significant,  while  the  repetition  wearout 
parameter  was  not.  And,  the  shape  of  the  response  function 
was  concave. 


Distribution  of  Forgetting  Rate  8 


■ Frequency 


Figure  3-4.  Non-normal  empirical  histogram 
forgetting  rate  parameter. 
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Empirical  Support  and  Models  Comparison 

The  purpose  of  this  section  is  to  provide  a more 
definitive  support  for  the  ad  wearout  model  formulation  and 
compare  its  theoretical  and  statistical  performance  relative 
to  several  other  awareness  forecasting  models  in  the 
literature.  In  this  section,  I will  describe  the  awareness 
tracking  data  collected  by  Millward  Brown,  Inc.,  who 
monitored  brand  awareness  generated  in  the  population  due  to 
advertising  efforts  of  a major  cereal  brand  in  the  U.K.  Then 
I will  present  several  alternative  models  of  awareness 
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formation,  both  for  new  as  well  as  established  brands.  The 
comparative  analyses  here  will  complement  previous  efforts 
by  Mahajan,  Muller  and  Sharma  (1984),  who  restricted  their 
analyses  to  awareness  forecasting  of  new  products  only. 

Then,  I will  describe  the  estimation  method  using  the  state 
space  modeling  approach  or  Kalman  Filtering. 

Awareness  Tracking  Data 

The  data  consist  of  time  series  of  brand  awareness  and 
gross  rating  points  (GRPs)  for  a brand  of  cereal,  in  a given 
market  in  the  U.K.  Data  are  collected  by  Millward  Brown, 

Inc.  on  a weekly  basis.  It  was  referred  to  me  by  Prof.  Jeff 
Harrison  at  the  University  of  Warwick,  U.K.  The  details  of 
the  monitoring  method  and  the  data  set  are  provided  by  Brown 
(1975),  Donius  (1986),  West  and  Harrison  (1989).  In  the 
following  figures,  I show  the  weekly  brand  awareness  and  the 
pattern  of  GRPs  for  75  weeks,  which  constitute  the  data  for 
further  analyses. 

The  pattern  of  media  spending  over  time,  that  generated 
the  brand  awareness  (in  figure  3-5),  is  shown  in  figure  3-6. 
A single  advertisement  campaign  was  used  during  this  period. 
It  may  be  noted  that  the  expenditure  pattern  is  non- 
monotonic and  has  several  weeks  of  no  spending  at  all.  In 
particular,  note  the  long  media  hiatuses  during  weeks  8 
through  15  and  from  weeks  49  through  75,  and  a few  short 
gaps  in  between. 
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Brand  Awareness 


weeks 

Figure  3-5.  % respondents  aware  of  the  brand  of  cereal 

due  to  its  TV  advertising. 


Cereal  TV  Spending 


weeks 


Figure  3-6.  The  media  schedule  employed  to  generate  brand 
awareness . 
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Alternative  Awareness  Formation  Models 

The  following  seven  models  are  briefly  described  in 
this  section  because  I will  later  use  them  for  comparing  the 
goodness  of  fit  relative  to  the  proposed  ad  wearout  model. 

1.  Vidale  and  Wolfe  (1957) 

2.  Nerlove  and  Arrow  (1962) 

3.  Brandaid  (Little  1975) 

4.  Tracker  (Blattberg  and  Golanty  1978) 

5.  Litmus  (Blackburn  and  Clancy  1982) 

6.  News  (Pringle,  Wilson  and  Brody  1982) 

7.  Dodson  and  Muller  (1982) 

The  first  three  models  describe  the  dynamics  of  brand 
awareness  formation  for  established  brands,  while  the  next 
four  models  describe  awareness  formation  for  new  products 
(see  Mahajan,  Muller,  and  Sharma  1984) . The  details  of 
justifying  the  model  form  and  its  implications  are  skipped 
here  for  the  sake  of  brevity  and  the  reader  is  referred  to 
the  original  references. 

1,  Vidale  and  Wolfe  (1957) 

This  classic  model  describes  awareness  formation  as 
follows.  Over  a small  period  of  time,  increase  in  brand 
awareness  are  due  to  the  brand's  advertising  effort  which 
converts  the  unaware  segment  of  the  market,  while  attrition 
of  the  aware  segment  occurs  due  to  forgetting  of  the 
advertised  brand. 

Denote  A(t)  as  the  awareness  at  time  t,  u(t)  as  the 
spending  rate  (GRPs/week) , P as  the  ad  response  coefficient, 
and  8 as  the  forgetting  rate.  The  mathematical  model,  in 
continuous  time  representation,  is  given  below, 


69 


dA 

dt 


3(1  - A)u  - 6A 


(5) 


while  its  equivalent  discrete  time  representation4  can  be 
obtained  as  follows. 


At  - At_j  = p(l  - At_1)  - 8At-1 

At  = a - 3ut  - 5)  At_!  + 3 ut 

The  discrete  time  representation  in  (5.1)  is  important 
for  estimation  purposes,  since  data  are  discrete. 

2.  Nerlove  and  Arrow  (1962) 

This  model  states  that  the  growth  in  awareness  depends 
linearly  on  the  advertising  effort,  while  awareness  decays 
due  to  forgetting  of  the  advertised  brand.  Mathematically, 
the  model  is  given  by  (6) : 


— = 3 u - 8 A , (6) 

dt 

and  its  discrete  time  representation  is 

At  = (1  - 8)  At-1  + 3 ut  . (6.1) 

3.  Brandaid  (Little  1975) 

This  model  states  that  brand  awareness  in  the  current 
period  depends  partly  on  the  last  period  brand  awareness  and 
partly  on  the  response  to  advertising  effort.  The  response 
to  advertising  effort  can  be  linear,  concave,  or  S-shaped. 

Denote  the  response  to  advertising  by  a nonlinear 
function  g(u).  Then,  the  Brandaid  model  can  be  expressed 
mathematically  as  in  (7)  below. 


4I  will  denote  discrete  time  by  subscripts,  while  continuous 
time  is  denoted  in  parenthesis  as  a function,  e.g.  At  and 
A ( t ) . 


70 


At  = (xAt_,  + (1  - a)g(ut ) 


g(u)  = 


u' 


(7) 


(s: 


<J)  + UY 

4.  Tracker  (Blattberg  and  Golanty  1978) 

The  model  states  that  the  incremental  awareness  depends 
on  the  advertising  effort  which  acts  on  the  unaware  segment 
of  the  market.  Mathematically,  the  model  can  be  expressed  as 

At  - At_j  = a - ea  - puM  a - At_.) 

A.  = (ea  - PUt)At_,  + (1  - ea  - PuM 

5.  Litmus  (Blackburn  and  Clancy  1982) 

This  model  states  that  current  period  awareness  is  a 
linear  combination  of  the  maximum  awareness  and  the  last 
period  awareness.  The  response  to  the  advertising  effort 
acts  on  the  maximum  awareness.  Denote  the  potential  as  A 
and  the  response  to  advertising  effort  as  g(ut)  = 1-exp  (- 
Put) , then  the  model  can  be  expressed  as 


At  = A*tt-e-pu‘)  + e-|5u‘. 


H-l 


(9) 


6.  News  (Pringle,  Wilson  and  Brody  1982) 

The  model  states  that  total  awareness  is  a sum  of  three 
components:  ANt  (previously  unaware  consumers  who  become 
aware  due  to  the  response  to  advertising  effort) , AEt 
(previously  aware  consumers  who  retain  their  awareness  due 
to  current  advertising) , and  ARt  (previously  aware  consumers 
who  retain  their  awareness  regardless  of  the  advertising 
effort) . Mathematically,  the  model  is  expressed  as 
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At  — ANt  + AEt  + ARt 
ANt  = (A*  - At-1)  a - e~^Ut ) 
AEt  = (At_j  - A0)  a - e_PUt) 


(io; 


ARt  = kiA,.,!  — A0  — AEt)  + A0 

On  re-arranging  and  simplifying  the  algebra,  the  above 
model  can  be  written  as 

At  = A*g(ut)  + a - k)  a - g(ut)  )A0  + (1  - g(ut)  )kAt_x  (10.1) 

where  the  response  function  is  g (u) =l-exp (~Put)  , A*  denotes 
maximum  potential,  Ao  is  the  initial  awareness,  and  k is  a 
retention  factor  (which  is  equal  to  1-5) . 

7.  Dodson  and  Muller  (1978) 

This  model  makes  a meaningful  distinction  between  'new' 
products  and  the  established  ones  by  explicitly  introduces 
the  role  of  word-of-mouth  effects.  In  all  previous  'new' 
product  models,  initial  awareness  Ao  is  the  only  parameter 
that  captures  the  distinction  of  new  or  established  brands. 

However,  the  word-of-mouth  dynamics  introduces 
quadratic  term  (for  example,  the  standard  Bass  model),  as 
can  be  seen  below. 

At  - At_x  = g(ut)  (1  - At_x)  + bAt_xd  - At_x)  - a - k)At_x  (11) 

In  (11),  incremental  awareness  is  generated  by 
advertising  as  well  as  word-of-mouth,  both  of  which  act  on 
the  unaware  segment  of  the  market.  The  word-of-mouth  effect 
is  proportional  to  the  size  of  the  aware  segment.  The 
awareness  decays  due  to  forgetting  of  the  advertised  brand. 

Here,  g(ut)  is  the  response  to  advertising  effort  ut 
and  the  parameter  b captures  the  word-of-mouth  effects.  We 


72 


should  expect  b = 0 for  the  established  brands,  in  which 
case  the  model  dynamics  are  similar  to  Vidale-Wolfe' s model 
(see  equation  5) . The  retention  factor  k is  negatively 
related  to  the  forgetting  rate  (k  = 1-8) . 

On  simplifying  the  algebra, 

At  = [1  - g(ut)  + b - (1  - k)  ]At_x  - bA2t_x  + g(ut) 

which  can  be  expressed  as 

At  = + g(ut) 

where  4>(At_1)  = pAw  - b A^_1  (11.1) 

and  p = [k  + b - g(ut)  ] 

An  Unified  Estimation  Framework  for  Dynamic  Models 

The  parameters  of  dynamic  models  are  estimated  by 
maximizing  the  joint  likelihood  of  observing  the  sequence  of 
data  {Yi,  Y2,  ...,  Yt}  . The  important  point  that  may  be  noted 
here  is  that  I express  the  joint  density  as  product  of 
conditional  densities  of  observing  the  new  datum  Yt,  given 
all  available  information  so  far,  denoted  as  It-i  = {Yi,  Y2, 

. ..,  Yt-i } . The  details  are  given  in  Appendix  D. 

In  doing  so,  the  optimal  state  estimation  approach 
(described  in  appendix  D)  is  in  contrast  with  the 
statistical  methods,  for  example  the  ordinary  least  square 
method  as  used  by  Mahajan,  Muller,  and  Sharma  (1984),  where 
the  joint  likelihood  is  expressed  as  the  product  of  marginal 
densities.  Consequently,  the  standard  statistical  estimation 
methods  implicitly  assume  and  impose  independence  across 


time  periods,  even  though  the  model  is  dynamic  in 
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formulation  (i.e.  explicit  inter-temporal  dependence  is 
recognized) . This  results  in  disregarding  the  stochastic 
nature  of  lagged  endogenous  variables  as  well  as  for  the 
time  sequence  in  which  data  {Yi,  Y2/  . ..,  YT}  unfold.  A 

consequence  of  this  is  biased  parameters  and  wrong  standard 
errors  (see,  e.g.,  Hamilton  1994,  p.  209),  which  will  lead 
to  misleading  inferences. 

The  method  of  optimal  state  estimation5  using  the  state 
space  modeling  approach,  also  known  as  Kalman  filtering,  is 
explained  in  Appendix  D.  One  advantage  of  this  approach  is 
the  common  estimation  framework  that  can  be  utilized  for  all 
dynamic  models.  This  will  be  evident  from  table  3-2  which 
shows  that  all  models  discussed  above  can  be  expressed 
compactly  in  the  state  space  form  (see  appendix  D,  equations 
D.2  and  D.3) . 

Estimation  Results 

I programmed  the  subroutine  to  compute  the  sample 
likelihood,  which  nests  the  recursions  in  (D.6)  of  appendix 
D,  in  Gauss  programming  language.  The  maximization  of  the 
likelihood  function  in  (D.4,  appendix  D)  was  carried  out 
using  OPTMUM  version  3.1.3  in  Gauss.  The  data  set  used  for 

5 The  concern  here  is  to  infer  the  best  path  of  the  state 
variable  given  its  initial  value  and  known  disturbance 
characteristics  of  its  motion.  While  in  the  classical 
estimation,  the  concern  is  to  infer  the  best  parameters  of 
the  distribution  so  that  the  likelihood  of  observing  the 
given  data  is  the  highest.  See  Jazwinski  1970  or  Harvey  1994 
for  details. 
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estimation  was  shown  in  figures  3-5  and  3-6  earlier.  The 
starting  values  were  obtained  from  the  ordinary  least  square 
method.  The  hill  climbing  algorithm  throughout  the  run  was 
BFGS  (see  Gill  et  al.  1981  for  its  details) . Convergence  was 
achieved  fairly  quickly,  usually  taking  just  a few  minutes 
for  approximately  50-75  iterations. 

The  results  of  estimation  are  given  in  table  3-3  for 
models  1 through  3,  in  table  3-4  for  models  4 through  7,  and 
table  3-5  for  the  proposed  Ad  Wearout  Model. 

The  parameter  estimates  are  meaningful  as  well  as 
comparable  across  all  models.  For  example,  all  models 
indicate  initial  awareness  Ao  » 40%  which  is,  indeed,  close 
to  the  first  observation  in  the  data  set  (see  figure  3-5) . 
Similarly,  the  transition  uncertainty  («  2.5)  and 
measurement  noise  («  5.0)  are  similar  across  models.  It  is 
interesting  to  note  that  measurement  noise  is  the  largest 
source  of  uncertainty,  which  implies  a need  for  a larger 
sample  size  in  surveys  of  brand  awareness  levels  in  the 
population. 

Results  for  Mature  Brands  Models.  The  following 
discussion  is  based  on  table  3-3. 

Vidale-Wolfe  model  parameters  (3  and  6 are  both  zero  at 
95%  significance  level.  This  implies  that  Tt=l  and  ct=0  in 
table  3-2.  In  other  words,  awareness  evolution  follows  a 
random  walk,  i.e.  advertising  does  not  influence  awareness 
formation  for  this  brand,  according  to  this  model.  A similar 
conclusion  can  be  reached  based  on  Little's  Brandaid  model. 
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The  smoothing  constants  a«  1,  which  implies  a negligible 
weight  on  response  to  advertising  g(u).  Hence,  awareness 
formation  is  almost  a random  walk,  according  to  this  model 
formulation. 

Further,  this  data  suggest  that  ad  response  function 
g(u)=  uV  (<j>+uT)  is  almost  always  concave.  This  follows  from 
the  estimated  value  of  y = 1.09  and  <))  = 0.95  for  which  the 
d2g/du.2  < 0 for  all  non-negligible  spending  rate  (i.e.  for 
all  u > 0.0532  GRPs/weeks) . 

However,  unlike  Vidale-Wolfe  and  Brandaid  models,  a 
simple  Nerlove-Arrow  model  shows  that  advertising  effort  and 
forgetting  are  both  operative  in  building  awareness  for  this 
brand.  This  follows  from  significant  nonzero  parameter 
estimates  for  3 and  8 in  table  3-3.  Moreover,  this  model 
explains  the  data  better  than  both  Vidale-Wolfe  and 
Brandaid. 

The  comparative  model  fitness  follows  this  rank  order: 
Nerlove-Arrow  (best)  with  LL*  = -172.99,  Brandaid  (LL  =- 
180.83)  and  Vidale-Wolfe  (worst)  with  LL*  = -184.89. 

Results  for  New  Product  Models.  The  discussion  here  is 
based  on  results  in  table  3-4. 

The  parameter  estimates  for  the  Tracker  model,  a » 0 and  P » 
0,  implies  that  Tt=l  and  ct=0  in  table  3-2.  Hence,  awareness 
follows  a random  walk,  i.e.  advertising  spending  does  not 
affect  awareness  formation  for  this  brand. 


Table  3-2 . System  matrices  for  awareness  formation  models 
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Table  3-2.  System  matrices  for  awareness  formation  models  (continued). 
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Table  3-3.  Results  for  mature  product  models 
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A similar  conclusion  can  be  reached  based  on  News  model 
(since  P=0  and  k«l),  Litmus  (since  P=0  regardless  of  A*,  k, 

Ao) , and  Dodson  and  Muller  (since  P=0,  k=0,  b=0) . 

That  brand  awareness  follows  random  walk  is  a plausible 
statement  for  this  brand  of  cereal  with  its  own  image  and 
age.  Consequently,  we  should  not  expect  strong  word-of-mouth 
effects  to  influence  its  awareness  growth.  Indeed,  this  is 
borne  out  by  the  estimate  of  b « 0 in  Dodson  and  Muller' s 
model . 

The  fit  of  various  models  here,  based  on  maximized  log- 
likelihood  value,  is  about  the  same  and  is  significantly 
worse  than  the  simple  Nerlove  and  Arrow  (1962)  model. 

Ad  Wearout  Models.  The  following  discussion  is  based  on 
results  in  Table  3-5. 

The  estimated  parameters,  shown  in  table  3-5,  have 
expected  signs,  are  significant,  and  meaningful.  For 
example,  measurement  uncertainty  (a6  = 5.82)  is  about  the 
same  magnitude  as  indicated  by  the  other  seven  models. 
Similarly,  the  initial  awareness  level  (Ao  = 41.99)  is  about 
the  same  as  the  first  observation  in  the  data  (see  figure  3- 
5)  and  comparable  to  that  estimated  by  other  seven  models. 
The  forgetting  effect  was  significant,  as  in  all  other 


models . 


Table  3-4.  Results  for  new  product  models 
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Table  3-4.  Results  for  new  product  models  (continued) . 
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In  particular,  the  copy  wearout  was  also  highly 
significant  (c  = 0.1231  and  t = 21.58),  while  repetition 
wearout  was  not  (hence,  dropped  during  the  estimation) . 
Further,  the  model  fits  the  data  quite  well,  as  can  be  seen 
in  figure  3-7  below.  Based  on  the  log-likelihood  value  (=- 
172.32),  the  proposed  ad  wearout  model  fits  this  data  better 
than  most  other  benchmark  models,  except  for  Nerlove-Arrow' s 
model . 


Table  3-5.  Estimates  of  ad  wearout  model 
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It  is  evident  from  figure  3-6  that  the  actual  pattern 
of  spending  is  not  uniform,  but  instead  several  weeks  of 
intense  spending  is  alternated  with  media  hiatus.  However, 
neither  Nerlove  and  Arrow' s model  nor  any  other  model 
estimated  here  can  explain  why  this  firm  has  chosen  a 
pulsing,  rather  than  an  even  spending,  schedule.  This 
conclusion  follows  from  Hartl's  (1987)  monotonicity  theorem 
which  assures  us  that  all  these  single-state  variable  models 
can  only  have  a monotonic  optimal  media  schedule,  and  never 
an  on/off  pulsing  schedule.  But  this  is  not  what  we  observe 
in  practice,  as  can  be  seen  in  figure  3-6. 
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So  Why  Pulse?  To  understand  this  issue,  I show  (in 
figure  3-8)  that  data  supports  the  hypothesis  that  ad 
quality^  is  decreasing  when  media  is  on,  and  that  it 
restores  when  it  is  off.  This  figure  is  obtained  by  plotting 
the  evolution  of  ad  quality  using  model  equations  (2)  and 
(3.1)  and  the  estimated  parameters  in  table  3-5.  The  drop  in 
quality  of  advertisement  is  due  to  the  significant  copy 
wearout  parameter  (see  equation  2) , while  its  rise  is  due  to 
the  net  effect  of  copy  and  forgetting  effects  (see  equation 
3.1)  . 


Ad  Quality  Evolution 


ad  quality 


weeks 


Figure  3-8.  Empirical  ad  quality  evolution. 


6 Recall  that  ad  quality  is  simply  a time-varying  ad 
response  coefficient  P in  the  classical  Nerlove  and  Arrow 
Model . 
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Hence,  it  makes  sense  to  not  advertise  after  a while 
when  the  quality  gets  quite  low,  and  allow  a period  of 
hiatus  for  ad  quality  to  restore  before  starting  the 
campaign  again.  That  such  a schedule  of  on/off  spending 
media  budget  is  also  optimal  will  be  shown  in  chapter  5. 

Summary 

As  shown  in  the  above  analysis,  it  is  possible  to 
characterize  the  empirical  relationship  between  advertising 
quality  and  quantity  effects  on  brand  awareness,  using  data 
from  tracking  studies.  In  particular,  ad  quality  declines 
when  advertising  is  on,  and  it  restores  during  a media 
hiatus.  Unlike  other  extant  models,  this  rise  and  fall  of  ad 
quality  explains  why  firms  pulse  their  media  spending.  The 
model  formulation  provides  a meaningful  description  of 
awareness  and  spending  data.  It  also  fits  this  data  better 
than  most  other  models,  in  the  statistical  sense.  Hence,  the 
proposed  ad  wearout  model  is  theoretically  superior  to  the 
previous  formulations  in  the  literature  and  statistically 
superior  to  all  but  Nerlove  and  Arrow' s model  for  these  data 
on  cereal  brand  advertising. 


CHAPTER  4 

THEORETICAL  RESULTS  FOR  BLITZ  AND  TWO-PULSE  SCHEDULES 

Overview 

In  the  previous  chapter  I formulated  the  ad  wearout 
model  and  showed,  within  the  context  of  the  acquired  data, 
that  it  provides  an  empirically  valid  description  of 
advertising  wearout  phenomenon.  Here  I will  analyze  its 
theoretical  properties  for  two  special  cases  of  pulsing 
schedules.  In  the  first  half  of  this  chapter,  I will  examine 
a blitz  schedule,  in  which  resources  are  concentrated  at  the 
beginning  of  the  horizon,  to  understand  the  dynamics  of 
awareness,  the  life-cycle  idea  of  Weilbacher  (1970),  as  well 
as  the  key  issue  of  whether  resources  should  be  concentrated 
or  not  (Sasieni  1971,  1989) . In  the  second  half,  I will 
introduce  the  role  of  advertising  spending  hiatus  and 
analyze  two-pulse  schedules  to  observe  insights  into  the 
issues  of  length  and  spacing  of  advertising  bursts  and  the 
optimal  managerial  response  to  changes  in  the  magnitudes  of 
the  copy  and  repetition  wearout  coefficients.  Thus,  I will 
attempt  to  answer  some  of  the  open  questions  left  for  future 
research  by  Simon  (1982)  and  Mesak  (1992). 
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Theoretical  Blitz  Schedule 


Dynamics  of  Awareness  in  the  Presence  of  Ad  Wearout 

In  figure  4-1,  panel  (a),  I illustrate  a theoretical 
blitz  schedule  in  which  an  advertiser  expends  the  entire 
media  budget  at  the  beginning  of  the  horizon.  Consider  a 
media  budget  of  B available  to  an  advertiser  who  wishes  to 
spend  it  in  first  1 weeks  of  the  planning  horizon  of  T 
weeks.  Mathematically,  a blitz  schedule  is  defined  by  the 
following  step  function: 

u(t)  = u t e [0,  1) 

= 0 t e [I,  T] 

where  a constant  spending  rate  u = B/l  is  maintained  when 
advertising  is  on. 

Based  on  the  ad  wearout  model  discussed  in  chapter  3, 
the  evolution  of  brand  awareness,  in  response  to  the  blitz 
schedule  in  (1),  is  given  by  the  following  equations. 

A.(t)  = e U ~ (e”aliI)t  - e-6t)  + A0  e-5t  t e [0,  1)  (2.1) 

1 5 - a(u) 

A2(t)  = A^)  e-8(t~J1  t e [1,  T]  (2.2) 

The  derivation  of  the  above  result  is  given  in  Appendix 
E. 

Consider  an  advertiser  who  spends  all  100  monetary 
units  in  the  first  25  weeks,  resulting  in  a spending  rate 
u ( t ) =4  for  te  [0, 25) , and  spends  nothing  for  the  rest  of 
the  planning  horizon  of  52  weeks.  For  this  spending  plan. 
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aggregate  awareness  during  the  first  25  weeks  builds  up,  and 
then  gradually  decays  due  to  forgetting  in  the  absence  of 
advertising  from  week  25  through  52.  In  figure  4-1  below,  I 
show  the  evolution  of  awareness  given  by  (2. 1-2. 2)  for  a 
blitz  media  schedule  in  (1)  . 


A(t) 


u(t) 


Figure  4-1.  Awareness  evolution  in  the  presence  of  ad 
wearout  for  a blitz  schedule. 


Such  awareness  growth  and  decay  curves  are  similar  to 
those  obtained  experimentally  (e.g.  Zielske  1959,  Strong 
1974) . However,  there  is  an  important  difference  when 
awareness  evolves  in  the  presence  of  ad  wearout:  the  decline 
in  awareness  sets  in  even  when  advertising  is  on.  The 
following  result  explains  why  a bend  should  be  expected  in 
the  learning  curve. 

Result  1:  Inverted-U  awareness  evolution 

The  awareness  evolution  in  (2.1)  has  unique  and  global 
maximum  at  some  time  t , which  marks  the  onset  of 
wearout,  given  by  the  expression 
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, Ln  5 - Ln  a(u)  , , . 

t = (3) 

8 - a(u) 

A proof  of  this  result  is  given  in  Appendix  F. 

Note  that  simply  the  presence  of  ad  wearout  induces  a 
decline  in  the  awareness  after  some  time,  regardless  of  the 
magnitude  of  wearout  rate  a(u).  This  is  consistent  with 

Weilbacher's  (1970)  notion  of  the  life-cycle  of  an 
advertisement  (see  literature  review) . According  to 
Weilbacher  (1970),  advertising  has  the  maximum  potential  to 
inform  and  entertain  initially,  and  then  its  sharp  edge  is 
lost  later  on,  and  its  effectiveness  wanes  gradually.  Such  a 
decline  in  awareness  is  empirically  observed,  see  wearout 
curves  in  figure  3-1. 

It  is  important  to  understand  that  this  decline  in 
awareness  will  not  occur  if  ads  were  never  to  wearout,  that 
is,  a (u) =0 . This  is  because,  in  the  absence  of  wearout, 
brand  awareness  in  (2.1)  builds  up  monotonically1  (see 
Appendix  G) . Hence,  ad  wearout  drives  the  decline  in 
awareness.  Below,  in  figure  4-2,  I show  the  comparative 
evolution  of  brand  awareness  in  the  presence  and  absence  of 
ad  wearout. 

The  intuition  for  decline  in  awareness  is  as  follows. 
Initially,  when  an  advertisement  is  new,  ad  quality  is  high. 


1 In  technical  terms,  t*  in  (3)  approaches  infinity,  as 
wearout  rate  a — » 0. 
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advertising  impact^  dominates  the  forgetting  effect;  hence, 
awareness  grows. 


A(t) 

wearout  absent 


Figure  4-2.  Comparative  dynamics  of  brand  awareness  in  the 
presence  and  absence  of  ad  wearout. 


As  media  spending  continues,  ad  quality  declines  due  to 

wearout,  advertising  impact  goes  down,  forgetting  effect 

exceeds  advertising  impact;  hence,  awareness  declines.  (See 

ad  wearout  model  equation  1 in  chapter  3.) 

Result  2:  Onset  of  wearout 

The  onset  of  the  decline  in  the  awareness  can  be 
delayed  by  lowering  the  rate  at  which  advertisements 
wear  out. 


Proof.  See  appendix  H. 

This  prediction  is  empirically  true,  based  on  figure  3- 
1.  It  can  be  seen  that  the  onset  of  wearout  under  high 


2 See  glossary;  advertising  impact  is  the  quality-adjusted 
media  spending.  Technically,  it  is  the  term  q(t)u(t)  in  the 
awareness  model  equation  1 of  chapter  3,  consistent  with 
Parsons  and  Schultz  (1976). 
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exposure  rate  (7-12  ads/month)  is  earlier  compared  to  the 
onset  under  the  low  exposure  rate  (1-3  ads/month) . 

This  result  underscores  the  importance  of  managing 
advertising  wearout  by  using  spending  rate  as  a control 
variable,  since  ad  wearout  depends  on  the  pattern  of  media 
schedule,  i.e.  a(u)  is  a functional  of  u(t).  Alternatively, 
even  executional  factors  can  be  employed  to  delay  the  onset 
of  wearout.  For  example,  Anand  and  Sternthal  (1990)  show 
that  ad  message  can  be  designed  to  be  challenging,  i.e.  not 
easy-to-process , in  order  to  lower  the  repetition  wearout 
rate,  and  thus  delay  the  onset.  Just  as  in  the  results  of 
Ray  and  Sawyer  (1971)  on  refutational  ads,  one  appealing 
explanation  is  the  complexity  of  message. 

Should  Resources  Be  Concentrated  or  Spread  Evenly? 

The  awareness  dynamics  under  blitz  spending  policy  is 
given  by  (2. 1-2. 2).  Consistent  with  the  literature  (see, 
e.g.,  Mahajan  and  Muller  1986,  Sasieni  1989),  I will  obtain 
the  total  awareness  generated  by  media  budget  B for  a blitz 
schedule  of  length  1 by  evaluating  the  area  under  the 
awareness  curve  over  the  infinite  horizon  [0,oo)  . That  is, 
total  awareness  generated  by  any  blitz  schedule,  denoted 
R(l),  is  computed  by 

R(I)  = I A(t)dt 

0 

= f Aj  (t)dt  + I A2  (t)dt  + I A2  (t)dt 

0 1 T 


(4) 
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where  Ai(t),  given  by  (2.1),  is  the  awareness  evolution  when 
advertising  is  on,  and  A2(t)  in  (2.2)  is  the  decay  in 
awareness  when  advertising  is  off.  The  third  integral  can  be 
interpreted  as  the  'salvage  value'  of  the  terminal  awareness 
level.  The  following  intermediate  result  will  be  required 
later  to  demonstrate  the  superiority  of  a blitz  over  an  even 
schedule . 

Result  3:  Revenues  for  any  blitz  schedule 

Revenues  generated  by  a blitz  schedule  lasting  for  1 

weeks  is 


RU) 


g(u) 
8 a(u) 


(1  _ e-a(n)J)  + -r- 


An 


(5) 


where  Ao  is  the  initial  awareness  level. 

Revenues  under  an  even  schedule  is  obtained  when  a 
'blitz'  lasts  for  the  entire  planning  horizon,  i.e. 
letting  i = T in  (5) . 

See  Appendix  I for  derivation. 

In  what  follows,  let  g(u)  be  any  concave  response 
function,  in  general,  with  dg/du  > 0 and  d2g/du2  <0.  I 
assume  concavity  because  I want  to  show  that  a pulsing 
schedule  (such  as  blitz)  can  be  superior  to  uniform  spending 
schedule  even  in  the  presence  of  diminishing  returns  to 
advertising  effort.  The  importance  of  such  a demonstration 
is  driven  by  two  reasons.  First,  advertisers  commonly  face 
concave  response  to  advertising  spending  (J. Simon  and  Arndt 
1980) . Second,  pulsing  has  never  been  shown  to  be  superior 
to  even  schedule  in  such  an  environment,  in  theory,  within 
the  classical  framework  for  a monopolist  (e.g.  Sasieni  1989; 
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see  Park  and  Hahn  1991  for  competitive  effects) . In  deriving 
the  following  Results  4 and  5,  I assume  constant  forgetting 
rate  and  zero  initial  awareness,  similar  to  all  previous 
studies  in  this  literature  (see,  e.g.,  Mahajan  and  Muller 
1986,  Sasieni  1989) . 

Result  4:  When  is  an  even  schedule  superior? 

When  there  is  no  ad  wearout  (a  = 0) , the  optimal  1 
that  maximizes  total  awareness  R(l)  is  1 = T. 

See  appendix  J. 

This  result  shows  that  spreading  resources  uniformly 
over  the  entire  planning  horizon  is  the  best  policy,  when 
response  is  concave  and  ad  wearout  is  absent.  This  result  is 
essentially  the  same  robust  result  of  the  extant  literature 
(e.g.  Sasieni  1971,  1989,  Mahajan  and  Muller  1986).  However, 
the  following  result  5 identifies  a crossover  interaction 
when  ad  wearout  is  present. 

Result  5:  Blitz  can  be  superior  when  ads  wear  out 

In  the  presence  of  advertising  wearout  (a  * 0) , some 
blitz  schedules  are  superior  to  even  schedule  while 
others  are  not.  In  particular,  there  exists  one  ’ best 
blitz  schedule'  which  dominates  the  even  schedule. 

See  Appendix  K for  proof. 

This  is  the  main  result.  The  intuition  for  this  result 
is  as  follows.  Copy  and  repetition  wearout  are  two  opposing 
forces.  Due  to  copy  wearout,  the  quality  of  advertisement 
declines  over  time.  Therefore,  it  behooves  an  advertiser  to 
spend  media  budget  at  the  beginning  of  the  planning  horizon 
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when  quality  is  still  high,  rather  than  spread  it  over  the 
entire  horizon.  Now  suppose  the  firm  were  to  spend  all  its 
budget  in  the  first  nano-second,  since  the  quality  is  the 
highest  initially.  Clearly,  this  plan  would  bore  the 
audience  because  the  spending  intensity  will  be  too  high, 
resulting  in  excessive  repetition  wearout.  Hence,  media 
budget  needs  to  be  spread  out  to  counteract  repetition 
wearout,  but  it  needs  to  be  concentrated  to  counteract  copy 
wearout.  These  two  opposing  forces  drive  the  interior 
solution  for  optimal  duration  of  blitz,  1 . In  other  words, 
there  exists  a 'best'  blitz  schedule  which  avoids  both 
extremes,  1=0  (nano-second  plan)  and  1 = T (even 
schedule) . Hence,  media  resources  should  be  optimally 
concentrated  rather  than  spread  evenly  in  the  presence  of 
advertising  wearout. 

Next,  using  a numerical  example,  I will  illustrate  the 
above  results  (4)  and  (5)  in  more  concrete  terms  and 
identify  the  best  blitz  schedule. 

A numerical  example 

Let  concave  ad  response  function  be  g(u)  = log(l+u), 
as  in  Luhmer  et  al.(1988).  Consider  the  following  arbitrary 
parameter  values:  c = 0.1,  w = 0.01,  d = 0.001,  B = 100,  T 
52  weeks.  The  spending  rate  is  u = B/I  and  wearout  rate 
a(u)=  c + w u.  Using  equation  (5),  I compute  total 
awareness  R(l)  generated  by  blitz  schedule  of  duration  1. 
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The  total  awareness  as  a function  of  blitz  duration  is  shown 
in  figure  4-3. 


Total 


Inferior 
To  Even 

Figure  4-3.  Total  awareness  as  a function  of  duration  of 
blitz . 


It  can  be  seen  from  figure  4-3  that  blitz  schedules 
with  intense  concentration  (say,  less  than  six  weeks,  in 
this  example)  or  thinly  spread  out  ones  (e.g.  even  schedule) 
are  both  undesirable  (revenues  nearly  9000  units  or  less) . I 
have  marked  a region  in  the  center  which  shows  several  blitz 
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schedules  that  are  superior  to  the  even  schedule.  There  is 
also  a region  in  which  the  even  schedules  performs  better 
than  some  highly  concentrated  blitz  schedules  (left-hand 
region  in  figure  4-3)  . But,  more  importantly,  there  exists 
one  best  blitz  schedule  lasting  for  19  weeks  which  will 
generate  the  maximum  total  awareness  of  11359  units.  Compare 
this  plan  with  the  even  schedule  which  yields  a total 
awareness  of  9000  units,  in  this  example. 

As  can  be  seen  from  the  above  example,  even  for  the 
concave  response  log(l+u),  and  without  any  assumptions  of 
convexity  (Sasieni  1971,  1989,  Mahajan  and  Muller  1986), 
asymmetry  (Simon  1982,  Luhmer  et  al.  1988,  Feinberg  1992), 
or  competition  (Park  and  Hahn  1991),  a pulsing  schedule  of 
finite  duration  on/off  is  superior  to  the  even  schedule.  Of 
course,  I have  assumed  that  advertising  wears  out,  i.e. 
quality  of  ads  declines  at  a rate  a(u). 

A distinction  between  copy  and  repetition  wearout  matters 
Here  I will  characterize  the  optimal  managerial 
response  to  enhancing  copy  and  repetition  wearout.  Denote  a 
as  a fraction  of  the  planning  horizon3  under  the  advertising 
blitz,  i.e.,  a = J/T.  Then,  every  optimal  blitz  duration  1 * 
will  also  have  a corresponding  optimal  extent  of 
concentration  a*.  The  dependence  of  optimal  concentration  a 

3 This  merely  re-scales  the  planning  interval  of  [0,  T]  to  a 
new  unit  interval  [0,1].  Here,  a = 0.25  means  the  first 
quarter  from  January  to  March,  or  a = 0.75  means  the  first 
three  quarters  from  January  to  September,  etc.  See  Mahajan 
and  Muller  (1986)  for  a similar  scaling. 
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on  copy  and  repetition  wearout  is  shown  in  figures  4-4  and 
4-5  below. 


Copy  Wearout.  Continuing  with  the  previous  example, 
figure  4-4  shows  how  optimal  concentration  a varies  with 
copy  wearout  parameter  c.  When  copy  wearout  rate  increases, 
a front-loading  of  the  media  is  desirable,  i.e.,  as  c 
increases,  optimal  a decreases . This  follows  from  the  fact 
that  when  copy  wearout  is  high,  ads  become  dated  rapidly, 
and,  hence,  it  is  desirable  to  spend  media  budget  early  on 
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while  quality  is  still  high  enough.  For  example,  think  of  ad 
campaigns  for  announcement  of  a corporate  name  change  to  the 
general  public,  new  product  launch,  re-staging  of  old 
product,  limited  time  promotional  announcement,  going  out- 
of-business  sale,  etc.  Here  a quick  blitz  campaign  becomes 
desirable  because  information  from  these  ads  get  dated  soon. 

Repetition  Wearout.  In  figure  4-5,  it  can  be  seen  that 
when  repetition  wearout  increases,  an  advertiser  should 
spread  media  budget  over  a longer  time  frame,  i.e.,  as  w 
increases , optimal  a*  increases . This  spreading  out  of  media 
budget  lowers  the  spending  intensity  (i.e.  as  1 increases, 
u=B/l  decreases)  which  counteracts  the  high  value  of  w by 
lowering  the  ad  wearout  rate  a(u)=c+wu.  Hence,  managerial 
actions  are  qualitatively  different  when  copy  wearout 
increases  and  when  repetition  wearout  increases. 

Theoretical  Two-pulse  Schedules 

In  this  section  I will  extend  the  previous  analyses  to  the 
case  of  two-pulse  media  schedules.  The  two-pulse  schedules 
conceptually  differ  from  the  blitz  schedule  due  to  the 
presence  of  media  hiatus,  i.e.  a time  gap  between 
advertising  bursts. 
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Figure  4-5.  Optimal  duration  as  a function  of  repetition 
wearout . 


Below,  in  figure  4-6,  I illustrate  a two-pulse  media 
schedule  to  introduce  the  notations  for  the  duration  of 
bursts  (li  and  12)  and  the  inter-burst  spacing  (Xi  and  x2)  . 


u(t) 


t. 


weeks 


Figure  4-6. 


A two-pulse  media  schedule. 
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The  two-pulse  media  schedule  can  be  expressed 
mathematically  by  defining  an  indicator  variable  as  follows: 
I (t)  = 1 0 < t < li,  (li  + Ti)  < t < U1+T1+I2) 

= 0 otherwise. 

Noting  the  constraints  that  li  + Xi  + I2  + x2  = T and  liU  + I2U  = 
B (since  media  budget  B needs  to  be  expended  during  the 
planning  horizon  T) , we  formally  have  a two-pulse  schedule 
as  u(t)=  u I ( t ) . 

The  dynamics  of  ad  quality  is  given  by  the  model 
equation  (3)  in  the  previous  chapter.  The  rise  and  fall  in 
quality  for  a particular  two  pulse  media  schedule  as  defined 
above  is  stated  in  the  following  result. 

Result  6:  Ad  quality  dynamics 

The  decline  in  ad  quality  when  advertising  is  on  (i.e. 
I(t)  = 1)  and  the  rise  in  ad  quality  during  media  hiatus 
(i.e.  I (t)  = 0)  is  given  by  the  following  expressions: 

q(t)  = e~amt  t g [0,  /,) 

= --(--  q,)  e~p{t~h)  t e (/,,  /,  + x,) 

P P 

= q2  e'a(G)  !t“^)  t g (/,  + x„  /,  + x,  + l2) 

= - - (-  - q 3)  t G (A  + x,  + /2,  T) 

P P 

where  p=c+5,  qi,  q2,  and  q3  are  given  in  appendix  L. 

Proof:  See  Appendix  L. 

Ad  quality  evolution  under  a two-pulse  media  schedule 
is  illustrated  in  figure  4-7. 


q ( t ) 


t,  weeks 


Figure  4-7. 
schedule . 


Ad  quality  dynamics  under  a two-pulse  media 


The  implications  of  the  quality  dynamics  are  as 
follows.  Ad  quality  rises  during  the  first  media  hiatus  when 
qi  is  below  some  level  q*;  similarly,  it  rises  during  the 
second  hiatus  when  q3  is  below  q . This  'some'  level  is 
given  by  the  asymptote  q*  = 5/ (c+5)  achieved  after  a 
theoretical  hiatus  of  infinite  duration,  i.e. 
g*  = lim  q,(T.)  = — < 1.  The  last  equality  arises  only  when 

Ti— >oo  2 p 

copy  wearout  c = 0.  In  words,  ad  quality  will  rejuvenate  to 
become  as  good  as  new  but  never  quite  like  the  original  in 
the  presence  of  non-negligible  copy  wearout. 

Result  7:  Awareness  dynamics 

Brand  awareness  grows  due  to  advertising  and  declines 
in  its  absence  according  to  the  following  expressions: 
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A(t)  = , g(n)~  {e”a(iI)t  - e_5t}  t e [0,  /,) 

o - a(u) 

= A,  e'8(t“'11  t e (/,,  /,  + x,) 

_ q2gC5)  ^e~a(u)  it-ft+Ti)  _ e-8(t-/i+Ti)  j 

8 - a(u) 


— Set — ij  +ti) 

+A2e 

_ e-8(t-)]+Ti  + J2) 


t G (/j  + Xj,  /j  + Xj  + /2) 

t £ (/,  + X,  + /2/  T] 


Proof:  See  Appendix  M. 


The  awareness  evolution  is  illustrated  in  figure  4-8. 


j t,  weeks 


Figure  4-8 
schedule . 


Awareness  evolution  under  a two-pulse  media 


The  above  awareness  evolution  was  obtained  as  the 
solution  to  the  model  equation  (1)  in  chapter  3 which  makes 
use  of  the  result  7 for  quality  dynamics  q(t) . Now  I will 
evaluate  the  area  under  A(t)  curve,  as  a measure  of  the 


total  revenue. 
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Result  8:  Revenues  generated  by  a two-pulse  schedule 

The  total  area  under  the  curve  A(t),  given  by  the 
result  8,  is  decomposed  as  R = Ri  + R2  + R3  + R4  + Rs/  where 


h g(u) 

A(t)dt  = — 

0 8 - a(u) 


1 


1 e 

a(u)  8^  ^ 8 


«i 


3-a(u)/] 

a(u)  [ 


'iyi  (1  - e-*1) 

R2  = J A(t)dt  = A,  

/,  0 


R,  = 


ll+t.+»2 

I A(t)dt 

1,+x, 

q2g(u) 


8 - a(u) 


. a(u) 


1 e 

5}  + (_ 8 


8/2 


afu)  [ 


+ A, 


(1  - e"8'2) 


Proof.  See  Appendix  N. 

Thus,  the  total  revenue  is  a function  R (Ti,  12, Ti, x2) 
since  awareness  levels  Ai,  A2,  and  A3  depend  implicitly  on 
the  parameters  of  the  two-pulse  schedule  (see  figure  4-8  and 
appendix  N) . In  order  to  obtain  the  optimal  two-pulse 
schedule,  the  revenues  given  by  function  R is  maximized  with 
respect  to  the  burst  durations  2i  and  12,  and  spacings  Ti  and 
t2  subject  to  the  constraints  (i)  h + ti  + I2  + ?2  - T,  and  (ii) 
Ixu  + l2u  = B,  i.e.,  u = B/  ( _Z.  1 +12)  • 

It  is  not  possible  to  solve  this  maximization  problem 
in  closed-form  given  its  rich  formulation.  To  gain  insights, 
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however,  I will  present  some  numerical  analyses  of  this 
optimization  problem. 

A Numerical  Example 

Revenue  as  a function  of  the  duration  of  bursts 

Suppose  that  a firm  can  determine  the  magnitude  of  the 
parameter  values  by  estimation  (say,  by  using  the  method 
shown  in  the  previous  chapter) . Let  these  values,  for 
example,  be  as  follows:  c=0.2,  w=0.01,  d=l,  T=52  weeks,  and 
B=100  media  units.  The  total  revenue  is  given  by  the  result 
8.  For  some  values  of  spacing  (say  xx  = 9.35  weeks  and  t2  = 
25.83  weeks),  the  revenue  function  R(h,l2)  looks  like  this: 


Figure  4-9.  Total  revenue  as  a function  of  the  duration 
of  advertising  bursts. 
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It  immediately  follows  from  figure  4-9  that  there 
exists  a two  pulse  schedule  which  can  do  better  than  an  even 
schedule  (which  will  have  li=26  weeks  and  12=26  weeks)  as 
well  as  a blitz  schedule  (which  will  have  12=0) • So,  it  is 
possible  to  find  the  optimal  two-pulse  media  schedule. 

Result  9:  Optimal  two  pulse  media  schedule 

The  optimal  two-pulse  schedule  is  shown  in  figure  4-10. 
The  optimal  duration  for  the  first  burst  is  l\  = 8.76 
weeks,  the  second  burst  duration  is  1*2  = 8.06  weeks,  the 
optimal  spacing  is  x*i  = 9.35  weeks,  and  the  second  hiatus 
lasts  for  x*2  = 25.83  weeks.  The  optimal  spending  intensity 
is  5.95  media  units/week.  The  maximum  revenue  due  to  this 
schedule  is  R* = 12.16  units.  It  can  be  verified  that  the 
constraints  (i)  li  + Xi  + 12  + x2  = T,  and  (ii)  liu  + l2u  = B are 
met . 

spending 
rate, u (t) 


u 
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* 
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10  20  30  40  50 

Figure  4-10. 


An  optimal  two-pulse  schedule. 
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The  main  contribution  of  discovering  the  optimal  two- 
pulse  schedule  is  that  there  is  a non-monotonic  schedule 
which  is  better  than  both  an  even  schedule  as  well  as 
concentrated  blitz  schedules . In  other  words,  it  is 
profitable  not  only  to  concentrate  spending  (as  shown  in  the 
first  half  of  this  chapter)  but  also  to  wait  and  re-start 
the  advertising  again  for  the  second  time.  The  benefit  of 
waiting  lies  in  the  restoration  of  ad  quality  due  to 
forgetting  effects  during  media  hiatus,  as  can  be  seen  (in 
figure  4-11)  from  the  ad  quality  evolution  associated  with 
this  optimal  schedule. 

Ad  Quality 
q (t) 


Figure  4-11.  Analytical  ad  quality  evolution  for  the 
optimal  two-pulse  schedule. 
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Next,  I will  investigate  the  sensitivity  of  the  optimal 
duration,  spacing,  and  maximum  revenue  to  the  variations  in 
copy  wearout,  repetition  wearout,  and  forgetting  rate.  These 
were  some  unresolved  issues  in  the  literature  (see  Simon 
1982,  Mesak  1992) . 

Result  10:  Effects  of  copy  wearout 

The  following  propositions  are  based  on  numerical 
analysis.  As  copy  wearout  increases, 

(i)  the  optimal  duration  of  bursts  decreases, 

(ii)  the  optimal  inter-burst  spacing  decreases,  and 

(iii)  the  maximum  revenue  decreases. 

This  is  shown  in  figures  4-12,  4-13  and  4-14  below. 


weeks 


Figure  4-12. 
increases . 


Optimal  duration  decreases  as  copy  wearout 
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weeks 


Figure  4-13.  Start  of  the  second  burst  (i.e.  h*  + O is 
earlier  when  copy  wearout  is  greater. 
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Result  11:  Effects  of  repetition  wearout 
As  repetition  wearout  increases, 

(i)  optimal  duration  increases, 

(ii)  optimal  inter-burst  spacing  increases,  and 

(iii)  maximum  revenues  decreases. 

These  propositions  are  illustrated  in  figures  4-15,  4-16, 
and  4-17  respectively.  The  intuition  is  that,  to  counter 
repetition  wearout,  the  spending  intensity  u(t)  is  reduced 
because  of  'spreading  the  media  budget  thinly'  over  a longer 
time  period. 


weeks 


Figure  4-15.  Optimal  duration  increases  with  repetition 
wearout . 
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weeks 


repetition  wearout,  w 

Figure  4-16.  Inter-burst  spacing  and  the  start  time 
increases  with  repetition  wearout. 


Figure  4-17. 


Revenue  decreases  with  repetition  wearout. 
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Figure  4-18.  The  effect  of  forgetting  rate  on  optimal 
duration  of  advertising  bursts. 


Result  12:  Effects  of  forgetting  rate 

For  most  part,  the  duration  of  advertising  bursts  is 
insensitive  to  the  variations  in  the  forgetting  rate. 
However,  for  low  forgetting  rates,  it  is  better  to 
concentrate  media  budget  in  the  first  burst  when  copy  is 
fresh,  while  the  second  burst  is  maintained  for  a shorter 
duration.  This  is  illustrated  in  figure  4-18. 

As  forgetting  rate  increases,  ad  quality  restores  more 
quickly;  consequently,  a shorter  inter-burst  spacing  is 
necessary  to  reach  a certain  ad  quality  level.  Hence, 
optimal  spacing  decreases  with  increase  in  the  forgetting 
rate,  implying  an  early  start  of  the  second  burst.  This  is 
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shown  in  figure  4-19.  The  maximum  revenues  due  to  the  best 
two-pulse  schedule  decreases  as  forgetting  rate  increases, 
as  shown  in  figure  4-20. 


weeks 


Figure  4-19.  The  effect  of  forgetting  rate  on  inter-burst 
spacing  and  the  start  time  of  the  second  burst. 


Summary 

The  theoretical  results  in  this  chapter  provide 
insights  into  the  complex  problem  of  allocating  media  budget 
over  time  in  the  presence  of  ad  wearout. 
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It  was  shown  that  when  ads  are  prone  to  wear  out,  it  is 
profitable  to  concentrate  media  budget  at  the  beginning  of 
the  planning  horizon  rather  than  spreading  it  over  the 
entire  horizon.  Further,  because  ad  quality  enhances  during 
media  hiatus,  it  is  beneficial  to  have  a time  gap  between 
two  bursts  of  advertising. 

It  is  possible  to  find  the  optimal  duration  of  both 
bursts  of  advertising  as  well  as  the  ideal  inter-burst 
spacing  to  achieve  the  maximum  revenue.  The  contribution  of 
discovering  an  optimal  two-pulse  schedule  is  that  there 
exists  a non-monotonic  spending  schedule  which  is  superior 
to  an  even  schedule  as  well  as  concentrated  blitz  schedules. 
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This  non-monotonic  pattern  of  spending  is  consistent  with 
the  observed  advertising  practice  (e.g.  the  use  of  pulsing 
media  schedules),  but  is  contrary  to  several  academic 
studies  (e.g.,  Mahajan  and  Muller  1986,  Hartl  1987,  Sasieni 
1989) . 

Sensitivity  analyses  was  performed  to  gain  insights 
into  several  unresolved  issues  noted  in  the  literature 
(e.g.,  Simon  1982,  Mesak  1992).  It  was  found  that  the 
optimal  duration  and  inter-burst  spacing  decrease  as  copy 
wearout  increases,  and  the  reverse  holds  when  repetition 
wearout  increases.  Hence,  copy  wearout  and  repetition 
wearout  are  conceptually  different  constructs  due  to  the 
opposite  nature  of  the  managerial  response. 

Several  propositions  were  offered  based  on  numerical 
analyses.  The  intuition  underlying  them  is  simple.  Since 
copy  wearout  is  related  to  ads  getting  dated,  it  behooves  an 
advertiser  to  spend  early  on;  hence,  a shorter  duration  and 
spacing  is  optimal  when  copy  wearout  is  larger.  Since 
repetition  wearout  is  related  to  the  audience  getting  bored 
due  to  excessive  frequency  of  exposure,  the  advertiser 
should  lower  the  spending  intensity  by  spreading  the  budget 
thinly  over  the  planning  horizon;  hence,  optimal  duration 
and  spacing  increase  with  enhanced  repetition  wearout. 

While  the  above  propositions  are  qualitative  guidelines 
that  may  enhance  our  understanding,  they  do  not  help  the 
manager  plan  superior  multi-pulse  schedules,  which  is  the 
focus  of  the  next  chapter. 


CHAPTER  5 

OPTIMAL  MULTI-PULSE  SCHEDULE 
Introduction 

In  this  chapter  I will  formulate  the  pulsing  problem, 
i.e.  a general  problem  of  finding  the  best  multi-pulse  media 
schedule.  I will  also  illustrate  a solution  procedure  to 
determine  the  best  pulsing  schedule  from  a finite  number  of 
admissible  policies.  In  doing  so  I will  demonstrate,  using 
real  data,  that  the  best  media  schedule  can  be  different 
from  an  even  spending  policy,  which  until  now  was  found  to 
be  the  best  policy  in  the  academic  literature  (Sasieni  1971, 
1989,  Mahajan  and  Muller  1986,  Little  1975,  1986)  in 
contrast  to  the  universal  preference  for  flighted  schedules 
observed  in  practice  (Little  1975,  Jones  1995) . Thus  the 
model  leads  to  theoretical  results  which  are  more  consistent 
with  practice. 

Problem  Statement 

The  extant  literature  (e.g.  Sasieni  1971,  1989, 
Corkindale  and  Kennedy  1975,  Simon  1982,  Mahajan  and  Muller 
1986,  Mesak  1992)  suggests  that  the  goal  of  the  advertiser 
is  to  find  that  media  schedule  which  maximizes  the  total 
awareness  generated  by  it  over  the  plan  period  as  well  as 
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the  value  of  the  terminal  awareness  level.  Essentially/  this 
criterion  is  equivalent  to  maximizing  total  awareness  over 
the  infinite  horizon  (see  pulsing  literature  in  chapter  2)  .f 
Denote  B as  the  total  gross  rating  points  (GRPs)  that  a 


certain  dollar  media  budget  can  buy  for  the  given  planning 
period  of  T weeks.  Let  It  be  the  variable  which  indicates 
whether  advertising  is  on  (1=1)  or  off  (1=0)  in  a given 
week.  When  advertising  is  on,  it  is  maintained  at  the 

g 

constant  spending  level  u = — / and  zero  when  it  is  off. 

Zlt 


Then,  conceptually,  the  pulsing  problem  is  to  find  the 
optimal  sequence  of  ones  and  zeros  (It)  for  t = 1,  2,  ..., 

T in  order  to  achieve  the  largest  total  awareness. 

Using  the  proposed  awareness  and  quality  dynamics,  the 
formal  pulsing  problem  can  be  stated  mathematically  as 
follows : 


Maximize 

nit) ) 


* - Y(T) 

" t?iYt  + 8(0) 


(1) 


subject  to  the  dynamics  of  awareness  At  and  quality  qt. 


1-Total  awareness  over  infinite  horizon  is 


I Y(t)dt  = I Y(t)dt  + I Y(t)dt  . Since  there  is  no  advertising 

0 0 T 

during  (T,  oo)  , awareness  Y(t)  decays  exponentially  at  the 
rate  6(0).  Hence  the  last  integral  is  given  by 


1 Y(t)dt  = ? Y(D  exp  (-8(0)  (t  - T)  )dt  = 

T T 


Y(T) 

5(0) 
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where  Yt  is  the  error-prone  observed  measure  of  awareness, 
as  given  below. 


(3) 


In  (2)  and  (3)  above,  all  disturbances  are  assumed  as 
gaussian,  zero  mean,  constant  variance  and  uncorrelated. 
That  is, 


/T  1 
0 


~ MVN 


vL 


0 0 

<4  0 

0 


(4) 


The  pulsing  problem  is  specified  by  equations  in  (1)- 

(4)  and  constitutes  a continuous  state-discrete  time, 

stochastic,  nonlinear,  dynamic  optimization  problem  (Sage 

and  Melsa  1971).  In(l),  the  largest  value  of  J is  searched 

over  all  admissible  pulsing  sequences  of  { It)t=i 

The  following  points  be  noted.  First,  there  are  (2T-1) 

sequences  of  { It } which  are  admissible.  This  follows  from 

the  fact  that  in  each  week  the  advertiser  can  choose  either 

to  advertise  or  not  to  advertise;  hence,  there  are  2T 

2 These  assumptions  are  made  for  the  sake  of  simplicity. 
However,  the  covariance  matrix  can  be  full  and  nonstationary 
(time-varying) , if  necessary,  in  some  practical  application 
of  this  approach. 
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spending  patterns  for  T weeks.  However,  a policy  of  not 
advertising  at  all,  that  is  {It  = 0,  for  all  t = 1,  2,  3, 

...,  T},  is  not  admissible  because  the  budget  B has  to  be 
expended  in  T weeks. 

Second,  the  quantity  J,  the  expected  payoff,  is  not  a 
random  variable  since  % = E[Yt|  St]  is  the  posterior  mean  of 
random  variable  Yt|St  ~ N(Yt,  ft)  , where  St  is  the 
information  set  at  time  t. 

Next,  I will  outline  a general  solution  procedure  to 
solve  the  pulsing  problem  implied  by  equations  (l)-(4)  . 

Solution  Procedure 

An  optimal  solution  to  the  pulsing  problem  is  the  zero- 
one  sequence  {It}  which  yields  the  maximum  expected  payoff 
J*  in  (1).  Conceptually,  for  any  arbitrary  sequence  { It > / we 
need  to  optimally  forecast  the  entire  path  of  (the 
conditional  mean)  awareness  {Yt}^  such  that  the  dynamic 

and  stochastic  characteristics  specified  in  (2) -(4)  are 
retained.  Thus,  for  this  arbitrary  sequence  {ItK  the  scalar 

O 

quantity  J({It};  0)  can  be  computed,  assuming  the  knowledge 

of  the  parameters  0 in  (2) -(4). 

A general  solution  to  the  pulsing  problem  is  outlined 
below  and  its  application  is  demonstrated  using  real  data. 


3 In  chapter  3 I showed  how  the  parameters  in  0 can  be 
estimated  using  the  principles  of  maximum  likelihood  and 
optimal  state  estimation  (see  appendix  D) . 
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General 

A dynamic  and  stochastic  model  of  awareness  formation 
is  specified  in  equations  ( 2 ) — ( 4 ) . The  media  budget  B is 
given.  Consider  an  arbitrary  trial  policy,  a zero-one 
sequence  for  T weeks,  and  the  corresponding  pulsing  schedule 
ut=ulf  For  these  pulsing  schedules  {ut}  and  an  indicator 
sequence  for  on/off  status  of  media  spending  {It}/  it  is 
possible  to  uniquely  specify  the  path  of  the  conditional 
mean  of  the  random  variable  Yt  (i.e.  awareness  score  at  time 
t) , given  the  initial  values  Ao  and  qo-  This  is  achieved  by 
the  forward  recursions  of  Kalman  filter  applied  to  the 
dynamics  in  (2)  and  (3) . Besides  the  mean  awareness  path, 
the  variance  of  Ytl&t  is  also  obtained  (i.e.  some  measure  of 
confidence  around  the  mean  awareness  path) . 

The  sequences  of  conditional  means  and  variances  of 
Yt|$t  are  obtained  recursively  as  follows.  In  any  period  t, 
the  dynamics  in  (2)  can  be  expressed  compactly  as 


°ct 


At 

_qt- 


T^t-i  + Ct  + vt 


where  Tt  is  the  transition  matrix,  ct  is  the  drift  vector, 
and  vt  is  the  (bivariate)  gaussian  disturbance.  Then  based 
on  all  information  up  to  t,  $t,  the  optimal  mean  and 
variance  of  the  random  variable  Yt  I &t  is 


Yt  = E[Yt|3t]  = [1  0]E[oct|$t]  = zat 
ft  = V[Yt|$t]  = zV^.jZ'  + oJ 

where  z = [1  0]  is  a matrix  of  constants,  at  is  the 
posterior  mean  of  state  vector  at  and  Vt|t-i  is  its  prior 
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covariance.  See  Harvey  (1994)  or  Jazwinski  (1970)  for 
details . 

The  posterior  mean  and  variance  in  (5)  completely 
characterize  the  distribution  of  Yt|St  because  of  the 
normality  assumption  in  (4).  Hence,  Yt|$t  ~ N(Yt,  ft)  for  all 

t. 

The  quantities  at  and  Vt,t-i/  required  to  compute 
posterior  moments  in  (5),  are  obtained  recursively  from  the 
standard  Kalman  filter  recursions  (see  equation  D.6, 
appendix  D) . However,  there  is  one  subtle  difference:  there 
is  no  observed  awareness  Yt  available  here  and  hence  the 
error  correction,  implied  in  (D.6),  is  zero.  This  is  as  it 
should  be,  since  the  posterior  and  prior  means  are  equal  in 
the  absence  of  exogenous  realization  of  Yt  data  (Harvey 
1994) . 

It  may  be  noted  that  the  optimal  sequences  of  mean 
{Yt >t=i  anc*  variance  {f,.}^  given  in  (5)  are  the  best 

forecast  possible  in  a class  of  any  functions  (see,  e.g., 
Hamilton  1994)  . This  follows  from  the  orthogonal  projection 
theorem  which  assures  the  optimality  for  linear-gaussian 
dynamic  system,  such  as  the  present  (Kalman  1960,  Sage  and 
Melsa  1971,  p.  255).  In  other  words,  "...the  Kalman  filter 
is  the  best  (minimum  error  variance)  linear  filter  for  any 
distribution;  it  is  the  best  filter  of  all  possible  linear 
and  nonlinear  estimators  if  the  plant  and  measurement  noises 
as  well  as  the  initial  state  are  gaussian"  (Sage  and  Melsa 
1971,  p.  254) . 
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Given  the  best  forecast  of  the  entire  path  of  the 
distribution  of  Yt|$t,  for  any  input  sequences  of  {ut}  and 
{It},  it  is  possible  to  compute  the  expected  payoff  in  (1) 
associated  with  every  media  schedule.  The  largest  expected 
payoff,  J*,  can  be  chosen  from  expected  payoffs  associated 
with  all  (2T-1)  schedules  and  a fixed  budget  B. 

The  binary  sequence  {1%}  associated  with  the  largest 
expected  payoff  J*  gives  the  optimal  pulsing  schedule  which 

g 

recommends  spending  at  the  rate  u = ~ when  It=l  (i.e. 

n; 

t=i 

ads  on)  and  not  spend  at  all  when  It=0. 

Optimal  multi-pulse  schedule:  An  application 

Here  I will  apply  the  above  procedure  to  the 
advertising  data  for  a real  cereal  brand  described  earlier 
in  chapter  3. 

Consider  the  pulsing  problem  of  allocating  the  given 
media  budget  to  each  quarter  of  the  year.  In  each  quarter 
the  media  planner  can  choose  to  advertise  or  not  advertise. 
This  will  result  in  24  = 16  pulsing  policies  including  a 
sequence  of  not  spending  at  all,  which  is  not  admissible. 

For  example,  one  such  schedule  on  a quarterly  basis  is: 
spend  during  Jan-Mar.,  do  not  spend  in  April- June,  spend  in 
July-Sept.,  and  do  not  spend  in  the  last  quarter.  This 
implies  the  following  binary  string: 

I={ 1111111111111000000000000011111111111110000000000000} . 

The  constant  spending  rate  in  this  example,  when  advertising 
B 

is  on,  is  u = — GRPs  per  week. 

26 
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Similarly,  all  fifteen  admissible  schedules  can  be 
exhaustively  listed.  The  budget  for  this  illustration  is 
assumed  as  B = 52  times  the  average  weekly  GRPs  over  the 
seventy  five  weeks  of  advertising  by  this  brand  (see  figure 
3-6)  . 

The  expected  payoff,  J,  associated  for  a media  schedule 
is  now  determined  by  evaluating  the  functional  in  (1) , once 
the  weekly  expected  awareness,  Yt  , for  this  schedule  is 
obtained.  This  weekly  expected  awareness  for  the  entire 
planning  period  is  obtained  from  the  optimal  forecast  of  the 
awareness  densities,  N(Yt,  f,.  , for  all  t = 1,  2,  ...,  T - 52 . 

These  densities  are  given  by  (5) . The  optimal  forecast 
constitutes  the  key  step  in  the  entire  procedure,  which  is 
based  on  Kalman  recursions  in  (D.6).  Thus,  the  expected 
payoff  J for  every  admissible  pulsing  schedule  can  be 
evaluated  from  (1) . 

Since  the  admissible  schedules  are  finite  and  few^ 

(e.g.,  fifteen  for  this  illustration),  all  expected  payoffs 
can  be  enumerated  exhaustively,  as  shown  in  table  5-1.  The 
largest  J,  and  the  associated  binary  string,  can  hence  be 
easily  determined.  Thus,  this  procedure  yields  the  optimal 
pulsing  schedule. 

Optimal  Pulsing  Schedule.  The  best  pulsing  schedule  for 
this  cereal  brand  is  one  in  which  the  firm  should  not 

4 if  the  set  of  feasible  schedules  is  large,  then  an 
efficient  search  procedure  will  be  required  to  find  the 
largest  J.  For  example,  branch-and-bound  method  or  genetic 
algorithm  can  be  explored  (see,  e.g.,  Goldberger  1989) . 
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advertise  in  the  first  39  weeks  and  advertise  in  the  last 
quarter  of  the  52-week  planning  year.  This  pulsing  policy 
was  found  to  yield  a maximum  expected  payoff  of  J = 

3265.96.  The  expected  payoff  for  the  even  schedule 
(advertise  in  all  52  weeks)  was  Jeven  = 989.49,  which  is  less 
than  one-third  of  the  best  pulsing  schedule. 

Hence , uniformly  spending  media  budget  is  not 
necessarily  the  best  media  schedule , contrary  to  the  extant 
literature.  In  fact,  for  this  brand,  even  schedule  happens 
to  be  the  worst  of  all  fifteen  schedules,  as  can  be  seen  in 
table  5-1  below. 

The  average  ad  quality  resulting  from  various  pulsing 
schedules  is  given  in  the  last  column  of  the  above  table. 
This  average  ad  quality  can  be  viewed  as  the  ad  response 
coefficient  P in  Nerlove  Arrow  model  (see  equation  5, 
chapter  4) . Since  the  even  schedule  does  not  have  any  period 
of  hiatus  for  ad  quality  to  restore,  it  results  in  a low 
average  ad  quality  over  the  plan  period. 

Although  high  average  ad  quality  (last  column,  table  5- 
1)  is  desirable,  it  is  not  necessary  that  schedules  with 
high  ad  quality  will  also  generate  high  revenues.  This  is 
borne  out  by  comparison  between  schedule  Nos.  8 with  q = 
0.17  relative  to  schedule  Nos.  14  with  q » 0.18.  This  is 
because  the  latter  schedule  spends  money  when  ad  quality  is 
high  during  the  third  quarter,  while  the  schedule  Nos.  4 
spends  money  in  the  first  quarter  (when  ad  quality  is  low 
initially)  and  then  waits  idly  for  quality  to  restore  later 
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during  the  year.  Hence,  it  is  important  to  take  into  account 
the  entire  dynamic  evolution  of  ad  quality,  rather  than  just 
average  summary  statistic,  in  planning  for  the  superior 
media  schedule. 


Table  5-1.  Pulsing  schedules  and  the  expected  payoffs. 


Pulsing  Schedules 

Expected 

Payoff 

Average  Ad 
Quality 

I1 

II 

III 

IV 

J(It) 

q=P 

on2 

on 

on 

on 

989.48 

0.0021 

on 

on 

on 

off 

1003.03 

0.0439 

on 

on 

off 

on 

1701.53 

0.0700 

on 

on 

off 

off 

1028.28 

0.1060 

on 

off 

on 

on 

1811.98 

0.0746 

on 

off 

on 

off 

2053.35 

0.1153 

on 

off 

off 

on 

2165.48 

0.1345 

on 

off 

off 

off 

1083.80 

0.1700 

off 

on 

on 

on 

1787.30 

0.0750 

off 

on 

on 

off 

2172.19 

0.1166 

off 

on 

off 

on 

3051.05 

0.1414 

off 

on 

off 

off 

3018.11 

0.1773 

off 

off 

on 

on 

2291.85 

0.1392 

off 

off 

on 

off 

3218.51 

0.1798 

off 

off 

off 

on 

3265.96 

0.1983 

1.  Roman  numerals  refer  to  the  quarters  of  the  year. 

2.  'On'  represents  a thirteen  week  advertising,  and  'off' 
means  no  advertising  in  that  quarter. 
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The  figure  5-1  below  shows  the  best  pulsing  schedule  as 
well  as  the  mean  awareness  path  {Yt}  for  t=l,  2,  3,  . ..,  T 
= 52  generated  by  it. 


Optimal  Media  Schedule 


weeks 


Figure  5-1.  A pulsing  policy  in  which  not  advertising  for 
first  39  weeks  and  then  spending  all  the  media  budget  in  the 
last  quarter  yields  the  maximum  expected  payoff  for  this 
brand.  The  bold  line  is  the  optimal  forecast  of  the  expected 
awareness  levels  for  this  pulsing  schedule. 


The  corresponding  ad  quality  evolution  is  shown  in 
figure  5-2.  It  can  be  seen  that  ad  quality  decreases  when 
advertising  is  on,  and  restores  when  advertising  is  off.  The 
dotted  line  shows  the  average  ad  quality  during  this  period, 
which  is  equivalent  to  the  constant  ad  response  coefficient, 
P,  in  the  traditional  Nerlove-Arrow  model. 
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Ad  Quality  for  the  Optimal  Schedule 


weeks 


Figure  5-2.  The  quality  of  advertising  restores  during 
the  hiatus  (first  39  weeks)  and  then  wears  out  in  the  last 
quarter . 


Since  the  initial  ad  quality  is  low  (q0  » 0.05)  for 
this  brand  (perhaps  due  to  excessive  advertising  in  the  last 
quarter  of  the  previous  year) , it  behooves  the  firm  to  not 
advertise  early  on  in  this  planning  year,  thereby  allowing 
the  quality  to  restore  before  starting  a heavy  burst.  This 
rationale  is  exploited  in  the  optimal  pulsing  schedule . 

Alternating  pulsing  schedule.  However,  it  is  likely 
that  a firm  may  perceive  the  above  best  pulsing  schedule  as 
risky,  albeit  superior.  This  is  because  of  the  high 
variability  in  the  awareness  pattern  over  time.  As  can  be 
seen  in  figure  5-1,  awareness  gets  as  low  as  10%  before  it 
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reaches  a high  of  over  75%.  Therefore,  a tactical  media 
planner  (usually  works  for  the  ad  agency,  not  the  client) 
may  wish  to  be  sensitive  to  the  client's  risk  preferences. 

In  such  cases,  it  is  possible  to  incorporate  the  risk 
preferences  in  the  objective  functional  in  (1),  and  then  re- 
optimize. ^ Alternatively,  it  is  more  straightforward  to 
adopt  some  other  schedule  which  is  comparable  in 
performance,  say  90%  of  the  best  schedule,  and  meets  other 
objectives,  for  example,  motivate  the  salesforce  or 
incentivize  channel  members.  The  media  planner  can  provide  a 
menu  of  'satisficing'  media  options  to  the  client  from  which 
to  choose  the  one  that  suits  his/her  soft-goals. 

For  example,  in  figure  5-3,  an  alternating  pulsing 
schedule  that  advertises  only  during  quarters  II  and  IV  is 
shown . 

The  awareness  evolution  generated  by  this  pulsing 
schedule  is  less  variable  than  the  one  generated  by  the 
optimal  policy.  This  is  because  the  awareness  level  does  not 
go  below  20%  and  never  shoots  beyond  50%.  (Compare  this  with 
figure  5.1) . The  expected  payoff  from  this  pulsing  schedule 
is  3051.05,  which  is  about  93.4%  of  the  best  schedule  — a 
small  price  for  risk  aversion. 


5 The  variance  of  awareness  can  be  modeled  as  state  or  time 
dependent.  Then,  to  incorporate  riskiness  of  policies, 
modify  the  objective  functional  J in  (1)  by  deducting ^a^ 
quantity  proportional  to  the  variance  of  Ytllt/  e.g*  J — J 

- 0.5  Iff 
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% aware  Alternating  Media  Schedule 


weeks 


Figure  5-3.  The  awareness  evolution  here  is  less  variable 
than  in  figure  5-1. 


This  alternating  schedule  (or  some  similar  one)  can  be 
quite  attractive  if  the  benefits  due  to  soft-goals  are 
subjectively  factored  in.  The  associated  evolution  of  ad 
quality  is  shown  in  figure  5-4  below.  The  average  ad  quality 
generated  by  this  policy  over  the  planning  horizon  is  0.14. 
It  may  be  noted  that  the  alternating  pulsing  schedule  is 
also  significantly  better  than  the  even  spending  policy  for 
this  brand.  This  explains  the  advertisers'  preference  for 
the  flighted  schedule  over  even,  in  this  case. 
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Summary 

Because  ad  quality  declines  when  advertising  is  on  and 
it  restores  during  media  hiatus/  a pulsing  schedule  can  be 
more  profitable  than  spending  media  budget  uniformly.  In 
chapter  3,  I provided  empirical  evidence  to  support  this 
formulation  of  my  model.  Here  I formulated  the  pulsing 
problem,  i.e.  a general  problem  of  finding  the  best  multi- 
pulse media  schedule.  I outlined  a solution  procedure  and 
implemented  it  using  real  advertising  data.  As  can  be  seen 
at  a glance  from  table  2-3,  such  empirical  demonstration  is 
rare  in  the  pulsing  literature. 
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For  this  cereal  brand,  I demonstrated  that  the  best 
pulsing  schedule  is  not  necessarily  an  even  spending  policy, 
contrary  to  the  extant  literature.  I have  shown  that  some 
pulsing  schedule,  with  finite  duration  of  on/off 
advertising,  is  superior  to  even  spending  policy,  regardless 
of  convexity  (Sasieni  1971)  or  asymmetry  (Simon  1982)  or 
competition  (Park  and  Hahn  1991).  I also  characterized  the 
optimal  pulsing  policy  and  the  associated  ad  quality 
evolution.  I compared  an  alternative  schedule  which  is  about 
as  effective  as  the  best  one,  but  not  as  'risky.'  Both  the 
best  and  alternating  pulsing  plans  were  found  to  be  far 
superior  to  the  even  spending  policy. 

Based  on  this  research,  it  can  now  be  concluded  that 
advertisers  who  use  pulsing  schedules  are  not  necessarily 
sub-optimal.  Second,  it  is  my  hope  that,  they  can  now  plan 
their  pulsing  schedules  more  scientifically  (see  Bly  1993) . 
In  appendix  A,  I have  noted  my  impression  that  advertisers 
do  not  know  how  to  use  tracking  data  for  planning  or 
decision-making  purposes,  although  several  advertisers  buy 
it  for  its  presentation  value.  Last  but  not  least,  this 
model  provides  the  'planning  value'  to  the  activity  of 
monitoring  brand  awareness  regularly.  After  all,  it  is 
important  to  track  historical  awareness  only  because  such 
information  can  be  used  in  planning  for  the  future. 


CHAPTER  6 
CONCLUSIONS 

Summary 

In  this  dissertation  I have  addressed  the  issue  of 
allocating  scarce  advertising  resources  over  time.  The 
central  finding  was  that  a given  media  budget  can  be 
expended  in  an  on-off  pattern  more  profitably  than  spending 
it  uniformly  over  time  when  advertisements  are  subject  to 
wear  out.  This  is  because  the  quality  of  an  advertisement, 
or  potential  effectiveness,  declines  when  the  advertising  is 
on  due  to  ad  wearout  effects,  while  it  rejuvenates  during  a 
spending  hiatus  due  to  forgetting  effects.  A continuous 
advertising  strategy  can  drive  ad  quality  to  such  a low 
level  that  it  is  no  longer  beneficial  to  keep  advertising. 
Instead,  a hiatus  in  advertising  is  desirable  to  restore  ad 
quality.  As  the  ad  quality  is  restored  above  some  threshold 
level,  it  becomes  worthwhile  to  commence  a new  burst  of 
advertising.  Hence,  a periodic  on— off  schedule  consisting  of 
several  bursts  of  advertising  is  superior  to  uniform 
spending  in  the  presence  of  advertising  wearout. 

The  proposed  dynamics  of  ad  quality  were  empirically 
tested  using  awareness  tracking  data  for  a real  brand  of 
cereal  in  the  U.K.  market.  The  model  fit  was  quite  good  and 
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superior  to  that  of  several  other  competing  model 
specifications.  The  empirical  study  shows  that  the  ad 
quality  for  this  cereal  brand  declined  when  advertising  was 
on  and  it  rejuvenated  during  periods  of  spending  hiatus. 

Several  new  theoretical  insights  were  gained.  In 
particular,  I showed  that 

• Brand  awareness  in  the  presence  of  advertising 
wearout  declines  even  when  advertising  is  on 

• The  onset  of  wearout  is  sooner  when  copy  and 
repetition  wearout  parameters  are  large 

• Blitz  schedule  can  be  superior  to  an  even  schedule  in 
the  presence  of  advertising  wearout 

• A distinction  between  copy  and  repetition  wearout 
matters  because  they  suggest  different  managerial  actions 

• In  a two-pulse  schedule,  the  second  burst  of 
advertising  should  last  for  a shorter  duration  than  the 
first  one 

• Optimal  duration  of  advertising  decreases  as  copy 
wearout  increases 

• Optimal  spacing  between  advertising  bursts  decreases 
as  copy  wearout  increases 

• Maximum  revenue  decreases  as  copy  wearout  increases 

• Optimal  duration  of  advertising  increases  as 

repetition  wearout  increases 

• Optimal  spacing  between  advertising  bursts  increases 
as  repetition  wearout  increases 
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• Maximum  revenue  decreases  as  repetition  wearout 
increases 

• Optimal  duration  of  advertising  is  insensitive  to 
changes  in  the  forgetting  rate  parameter 

• Optimal  spacing  between  advertising  bursts  decreases 
as  forgetting  rate  increases 

• Maximum  revenue  decreases  as  forgetting  rate 
increases 

In  addition  to  deriving  these  general  insights,  I have 
shown  how  a firm  can  use  the  model  as  a decision  support 
system  for  determining  optimal  advertising  pulsing  schedule. 
Since  some  pulsing  schedules  are  better  than  others,  a 
decision  support  system  is  required  to  determine  the  best 
one.  The  application  of  the  proposed  decision  support  system 
was  demonstrated  using  the  available  cereal  brand  awareness 
tracking  data. 

To  summarize,  based  on  findings  in  behavioral  research, 
I have  developed  a model  of  advertising  dynamics  that 
incorporates  ad  wearout  and  which  can  assist  an  advertiser 
to  plan  optimal  pulsing  schedules. 

Future  Research 

In  this  dissertation  I have  investigated  the  optimal 
advertising  strategy  in  the  presence  of  advertising  wearout. 
I see  three  key  issues  for  future  research  in  this  area. 
First,  the  determination  of  optimal  campaign  replacement 
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strategy.  There  are  interesting  trade-offs  between 
advertising  scheduling  and  copy  creation  and  replacement 
decisions  which  need  to  be  studied.  More  specifically/  as  ad 
campaigns  wear  out  there  is  a need  to  find  new  creative  ways 
of  communicating  product  benefits  and  positioning  (Goodrum 
and  Dalrymple  1994  p.52-54.) . This  leads  to  the  question  of 
when  should  ad  campaigns  be  replaced?  Pekelman  and  Sethi 
(1978)  raise  this  issue  and  also  provide  an  elegant 
solution.  However,  they  have  ignored  the  empirical  fact  that 
firms  employ  non-monotonic  pulsing  schedules  which  result  in 
rejuvenating  ad  quality  during  spending  hiatus;  this  extends 
the  life  of  the  campaign,  and  hence  delays  its  replacement. 
So,  the  timing  of  copy  replacement  in  the  presence  of 
wearout  and  pulsing  schedules  is  an  open  question. 

Second,  optimal  advertising  scheduling  in  the  presence 
of  competition  is  an  under-researched  area.  In  particular, 
existing  research,  for  example,  Villas-Boas  (1993)  and  Hahn 
and  Park  (1993),  have  ignored  advertising  wearout  phenomenon 
and  its  effect  on  optimal  advertising  strategies. 

Third,  optimal  advertising  scheduling  considering 
seasonality  as  well  as  wearout  is  another  interesting  and 
important  topic  for  future  research.  Currently  there  is  no 
paper  on  this  subject  in  the  marketing  science  literature. 


GLOSSARY 


Ad  Quality 

A parameter  for  the  quality  of 
advertisement  in  sales  response  models 
(see  Parsons  and  Schultz  1976,  p.85).  A 
time  varying  coefficient  for  media 
spending  effects,  denoted  as  Pt  or  q(t) . 

Ad  Response 
Function 

A function  that  captures  the  notion  of 
diminishing  returns  (concave)  or 
increasing  returns  (convex)  to  media 
spending  effort  (see  Simon  and  Arndt 
1980) . 

Ad  Wearout 

The  deterioration  in  the  quality  of 
advertisement  (see  Greenberg  and  Suttoni 
1973)  either  due  to  excessive  frequency 
(repetition  wearout)  or  due  to  passage  of 
time  (copy  wearout) . 

Advertising 

Impact 

A notion  of  quality-adjusted  media 
spending  effect  (see  Parsons  and  Schultz 
1976,  p.85;  Arnold  et  al.  1987).  It  is  a 
product  of  quality  of  advertisement  and 

the  amount  of  media  spending,  i.e.  (3t  ut  . 

Awareness 

An  effect  of  advertising  communication.  An 
intermediate  market  response  variable  of 
interest  (see  Hanssens,  Parsons  and 
Schultz  1993) . The  number  of  consumers  who 
recall  the  name  of  the  advertised  brand 
first  (see  Brown  1985) . 

Blitz 

A schedule  of  spending  media  budget  in 
which  a burst  of  advertising  is 
concentrated  over  a time  period  shorter 
than  the  planning  horizon  (see  Mahajan  and 
Muller  1986) . 

Burst 

A practitioners'  term  for  a blitz  spending 
pattern  (see  Bly  1993) . 
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Control 

Variable 


Copy  Wearout 


Decay 


Decline 


Even  Schedule 


Exogenous 

Variable 

Flighted 

Schedule 

Forgetting 

Rate 

Intensity 
Media  Budget 

Planning 

Horizon 

Potential 

Effectiveness 


A variable  under  a decision-maker's 
control.  A variable  whose  optimal  value  or 
time  path  is  desired  (see  Kamien  and 
Schwartz  1990)  . 

A deterioration  in  ad  quality  over  time 
for  reasons  other  than  repetition  (see 
Pekelman  and  Sethi  1978) . Denoted  by  a 
parameter  c which  captures  the  magnitude 
of  copy  wearout. 

A drop  in  aggregate  awareness  levels  due 
to  the  phenomenon  of  forgetting  (see 
Zielske  1959) . 

A verb  used  to  refer  to  the  deterioration 
in  ad  quality.  It  is  distinct  from  'decay' 
of  aggregate  awareness. 

A schedule  of  spending  media  budget 
uniformly  over  the  entire  planning  horizon 
(see  Mahajan  and  Muller  1986) . 

An  input  or  independent  variable  that  is 
external  to  the  model.  A variable  not 
under  the  decision-maker's  control. 

A practitioners'  term  for  spending  media 
budget  in  a few  bursts  in  alternation  with 
periods  of  no  spending  (see  Bly  1993) . 

An  exponential  rate  of  decay  in  awareness 
levels,  denoted  as  8. 

The  level  of  media  spending  rate  at  any 
time  t. 

A dollar  amount  available  to  buy  media 
space  or  time  to  advertise.  A measure  for 
the  effort  of  advertising,  denoted  as  B. 

A span  of  time  for  which  a media  schedule 
is  designed,  denoted  as  T (usually  52 
weeks) . 

An  intuitively  meaningful  term  for  ad 
quality  construct;  a time-varying 
coefficient  for  advertising  effects  in 
response  models. 
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Pulse 

Quality- 

Adjustment 

Factor 

Repetition 

Wearout 


Schedule 


Spacing 


Spending  Rate 


Switching  media  advertising  on  or  off. 

A notion  that  captures  the  time  varying 
effectiveness  of  an  advertisement  copy 
(see  Parsons  and  Schultz  1976,  p.85, 
Little  1975) . Also  called  as  ad  quality. 

A deterioration  in  ad  quality  due  to 
repetition  of  advertisements  (see  Ray  and 
Sawyer  1971;  Blair  1987) . The  parameter  w 
captures  the  magnitude  of  repetition 
wearout . 

A specification  of  how  to  spend  media 
budget  over  time  for  a given  planning 
horizon.  A pattern  of  spending  media 
budget  over  time. 

A time  gap  between  two  bursts  of 
advertising;  the  duration  of  media  hiatus 
(see  Calder  and  Sternthal  1980)  . 

The  amount  of  media  spending  in  dollars 
per  week  (see  Little  1975) . 


APPENDIX  A 

IMPRESSIONS  BASED  ON  AN  INFORMAL  SURVEY 


I contacted  several  leading  advertising  agencies, 
advertisers,  and  marketing  consulting  firms  in  the  U.S. 

These  include  the  following:  DDB  Needham  (Helen  Katz) , Leo 
Burnett  (Sheryl  Harkins),  Johnson  and  Johnson  (Jill  Lesko) , 
Copernicus  (Steve  Tipps) , and  Prof.  Lancaster  (UF  Faculty  in 
Advertising) . Based  on  my  discussions  with  these  people,  the 
following  salient  points  can  be  noted. 

1.  Flighted  media  schedule  is  the  most  prevalent 
pattern  of  spending  advertising  budget  for  the 
established  brands,  while  up-front  loading  (blitz) 
is  usually  used  for  new  products  or  re-staging  of 
old  products. 

2.  Most  often,  media  plans  are  based  on  "bogus 
intuition"  or  arbitrary  rules  of  thumb,  such  as 
"frequency  of  3+." 

3.  Most  advertisers  do  not  know  about  the  rate  at 
which  their  advertisements  are  wearing  out,  nor  do 
they  know  how  to  determine  it  empirically. 

4.  Most  respondents  believe  that  a quantity  like  'the 
rate  at  which  advertisements  wear  out'  is  relevant 
for  ad  copy  replacement  decisions,  these  decisions 
are  presently  made  on  "ad  hoc  basis"  due  to  lack  of 
formal  guidelines. 

5.  Besides  ad  wearout,  scheduling,  and  copy 
replacement,  advertisers  are  interested  in  knowing 
about  whether  existing  advertisements  are 
effective,  how  well  a new  commercial  is  performing, 
and  how  much  did  they  gain  by  changing  commercials. 
At  present,  these  assessments  are  based  on 
guesswork. 
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6.  Several  advertisers  buy  awareness  tracking  data 
from  Millward  Brown,  Inc.  but  do  not  know  how  to 
make  use  of  it  for  planning  or  decision  making 
purposes  (e.g.  scheduling,  copy  replacement) . It 
appears  that  these  awareness  plots  have  some  use  in 
presentations  on  brand  performance. 

7.  My  overall  impression  is  that  the  state-of-the-art 
in  media  scheduling  is  quite  ad  hoc  and 
atheoretical . 


APPENDIX  B 
NOTATIONS 


dA 

rate  of  change  in  awareness. 

dt 

dq 

dt 

rate  of  change  in  ad  quality. 

u 

fixed  level  of  spending  rate. 

5 

forgetting  rate  parameter. 

P 

ad  response  coefficient. 

<35 

0 

vector  of  parameters  to  be  estimated. 

a,  y,  X,  a, 
etc. 

parameters  whose  meaning  is  local  to  the 
context . 

AAt 

change  in  awareness  in  unit  time  interval . 
information  set  at  time  t;  includes  current 

and  previous  values  of  dependent, 
independent,  and  control  variables  of  a 

dynamic  system;  St-i  cz  St- 

St,  T]t,  Vt,  etc. 

error  terms,  assumed  as  normally 
distributed. 

A ( 0 ) , Ao 

initial  awareness  at  time  t=0. 

A ( t ) 

awareness  at  time  t. 

a (u) 

ad  wearout  rate,  a(u)=  c + w u. 

B 

media  budget  (in  $ or  GRPs) . 

c 

copy  wearout  parameter. 

Ct 

drift  vector  in  state  space  models. 

g(u) 

response  to  advertising  effort  u(t). 

I (u) 

indicator  variable;  1 when  media  is  on,  0 
when  media  is  off. 

J 

value  of  the  objective  functional. 
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k 

1 

q(0),  q0 
q ( t) 

R 

t 

T 

Tt 

u(t) 


w 

z 


retention  rate  or  carry-over  factor, 
duration  of  a blitz  schedule, 
initial  ad  quality  at  time  t=0. 
ad  quality  at  time  t. 

total  awareness  for  some  spending  pattern, 
time  (in  weeks) . 

planning  horizon  (usually  52  weeks) . 
transition  matrix  in  state  space  models, 
spending  rate  (in  $ or  GRPs/week) . 
repetition  wearout  parameter, 
constant  vector  in  state  space  models. 


APPENDIX  C 

BOOTSTRAPPING  INFERENCE  IN  SMALL  SAMPLES 


The  basic  idea  of  bootstrapping  is  as  follows.  For  a 
finite  sample  consisting  of  N observations  {Xi,  X2/  X3, 

...xN},  the  maximum  likelihood  nonparametric  estimator  of 
the  population  distribution  is  the  (discrete)  probability 
function  that  places  probability  mass  1/N  on  each  of  the 
observations  Xi  (Efron  and  Tibshirani  1986) . The  startling 
implication  of  this  statement  is  that  when  the  sample 
contains  all  the  available  information  about  the  population, 
it  is  possible  to  proceed  as  if  the  sample  is  the 
population,  and,  hence,  one  can  sample  with  replacement  from 
the  sample  itself  to  infer  about  the  population 
characteristics  (Efron  1979,  Noreen  1989) . 

In  applying  the  bootstrapping  principle,  I compute  the 
prediction  error  vector  e = A(x)-f(x,0),  where  0 is  the 
maximum  likelihood  estimate  of  parameters  based  on  the  real 
sample.  I sample  with  replacement  from  the  vector  of 
prediction  error  e = {ei,  e2,  e3/  ...eia)T  such  that  each  ei 
has  a selection  probability  of  1/12.  Denote  one  such 
sampling  realization  as  ei*.  Knowing  the  estimated  parameter 
vector  0,  the  independent  variable  matrix  x,  and  the  form 
f (.,.),  we  can  construct  Ai*  = f (x,  0)+  ei*.  Based  on  the  new 
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Ai*,  known  form  f and  exogenous  x,  we  proceed  with 

estimation  as  if  this  is  just  another  probabilistically 
equivalent  data  set.  Denote  the  new  parameter  estimates  as 

A A A 

9x.  This  process  was  repeated  1000  times  to  get  02/  03/  04/ 
0iooo.  Such  a collection  of  0 comprise  the  bootstrap 
distribution  of  parameters. 

The  code  for  obtaining  bootstrap  distribution  of 
parameters  was  written  in  Gauss  programming  language.  The 
empirical  histograms  for  copy  wearout  parameter  (figure  3- 
2),  repetition  wearout  parameter  (figure  3-3),  and 
forgetting  rate  parameter  (figure  3-4)  are  shown  in  the  main 

text . 

In  this  approach  the  95%  confidence  interval  for  any 
parameter  is  constructed  by  noting  the  25th  smallest  and  the 

A 

25th  largest  value  of  0 collections. 


APPENDIX  D 

OPTIMAL  STATE  ESTIMATION 


The  parameters  of  dynamic  models  are  estimated  by 
maximizing  the  joint  likelihood  of  observing  the  sequence  of 
data  {Yi,  Y2,  . ..,  YT}  . Formally,  the  joint  density  is  given 

by 

Pr  (Yj,  Y2,  . . . , Yt;  0) 

= Pr  (Yt|  {Ylf  Y2,  . . . , YT_X} ) x Pr  ( {Y:,  Y2,  . . . , YT_X}  ) 

= Pr  (Yt|  IT-1)  Pr  (Yj.jl  IT_2)  Pr  ( {Y:,  Y2,  . . . , YT_2} ) (D.l) 

= Pr  (Yt|  Im)  Pr  (Yt_J  It_2) Pr  (YJ  I0) 

= ft  Pr  (Yt|  1^) 

t = l 

State  Space  Form.  Consider  the  following  two 
equations . 

Yt  = zat  + et,  et  ~ N(0,  H)  (D.2) 

at  = T;  a,..!  + ct  + vt  vt  ~ N(0,  Q)  (D.3) 

where  Yt  is  the  observed  awareness  scores,  z is  a vector  of 
constants,  at  is  the  dynamic  process  of  interest  (e.g. 
awareness  evolution) , Tt  is  the  transition  matrix  and  Ct  is 
the  drift  vector  (both  together  constitute  sufficient 
information  to  specify  the  dynamics  of  any  model),  {st,vt} 
are  gaussian  error  terms. 

The  equation  (D.2)  is  called  as  the  observation 
equation,  and  (D.3)  is  referred  to  as  the  transition 
equation  (see  Harvey  1994) . All  previous  models  can  be 
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expressed  compactly  in  the  above  two  equations;  see  table  3 

2. 

For  this  state  space  form,  the  log-likelihood  function 
of  parameters,  given  the  observed  data  {Yi,  Y2,  ...»  Yt}  is 

given  by  (see  Harvey  1994,  p.125-127), 


LL(0)  = Log  (PrfY,,  Y2,  . . . , YT|  0} ) 

= Log  (IT  Pr{Yt|  I^}) 

t ^ (D . 4 ) 

= I Log  (Pr{Yt|  I,.,}) 

t = l 

= - ~ Log  (2  7C)  - ^ 2^  Log  (det  (Ft)  ) - - 2^  e;Ft_1et 


where  et  = Yt  - Ytlt_,  and  det(Ft)  denotes  the  determinant  of 
covariance  matrix  Ft . 

The  quantity  Yt|t_1  and  Ft  are  required  to  compute  the 
likelihood  function  in  (D.4).  These  are  given  by  the 
following  expression  in  (D.5), 

^tit-i  = zat-i 

Ft  = z Vut-i  z'  + H 

which,  in  turn,  depends  on  quantities  at-i  and  Vt|t-i*  The 
latter  quantities  are  obtained  recursively  from  the 
following  expressions  in  (D.6). 

atit-i  = at-i  ct 

= W-iV  + Q 
at  = atit-i  + Kt(Yt  _ z at|t_1) 

- KtzVtlt_1 

Kt  = Vt|t_1z'(zVt|t_1z'  + Hr1 


(D.6) 
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The  optimality  of  recursions  in  (D.6)  were  shown  by 
Kalman  (1960)  for  discrete  time  and  Kalman  and  Bucy  (1961) 
for  continuous  time  state  space  models. 

For  any  trial  value  of  0,  prediction  error  et  and 
covariance  Ft  are  available  recursively  based  on  the 
recursions  in  (D.6) . This  crucial  step,  based  on  filtering 
theory  (see,  e.g.,  Jazwinski  1970),  retains  the  dynamic 
characteristics  inherent  in  the  data,  unlike  other 
approaches  (e.g.  ordinary  least  square).  The  optimal  0*  is 
obtained  by  maximizing  the  likelihood  function  LL(0)  in 
(D.4)  using  standard  numerical  procedures  (see,  e.g.,  Gill 
et  al . 1981) . 


APPENDIX  E 

AWARENESS  EVOLUTION  IN  THE  PRESENCE  OF  WEAROUT 


Consider  a blitz  schedule  given  by  (1) . Note  that 

spending  rate  u=B /I  is  constant,  i.e.  independent  of  time, 

when  advertising  is  on  during  the  interval  [0,1) . Hence,  the 

response  function  g(u),  forgetting  rate  8,  and  ad  wearout 

rate  a(u)  do  not  depend  on  time  t.  Over  this  interval 

dq 

[0,1),  ad  quality  evolves  according  to  - -a(u)q  , or 

equivalently  q(t)  = e~a(C)t  , assuming  q (0)  is  initialized  at 
unity. 

The  evolution  of  awareness  A(t)  is  specified  by  the 

dA 

ordinary  differential  equation  (o.d.e.)  : — = q(t)  g(u)  - 5A. 
Substituting  q(t)  = e~a<iI)t  , we  get  awareness  evolution  over 
the  interval  [0,1)  as  follows: 

— = e_a®  t g(u)  - 6A(t)  (E  • 1 ) 

dt 

Note  that  (E.l)  is  linear  in  A(t),  has  constant  coefficient 
5,  and  a time-varying  forcing  function  e"a(li)tg(u)  . Hence,  it 
can  be  solved  by  the  method  of  integrating  factors  (I.F.) . 
The  I.F.  is  exp(j5dt)  = exp(8t),  since  8 is  constant.  On 
multiplying  throughout  by  the  I.F. 

e8t  = 5t  e-at  g(u)  - 8 e5t  A(t)  , 
dt 
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writing  the  above  o.d.e.  as  an  exact  differential, 

— (e^t  A(t) ) = g(u)  e®  “ 
dt 

and  integrating  the  above  result,  we  get, 

e5t  A(t)  = \ g(u)  e(5(u)  “ ^ dt  (E.2) 

e(8  - a(u)  )t 

= g(u) — rr — + const. 

8 - a(u) 

where  the  constant  of  integration  depends  on  the  initial 
awareness  level  A(0),  which  can  be  determined  on 
substituting  t=0  in  (E.2).  Thus,  awareness  dynamics  over  the 
period  [0,1)  is  given  below: 

A (t)  = g(n)  - (e-St  _ e-a(u)t)  + a(0)  e-St  (E.3) 

1 8 - a(u) 

Now  consider  the  interval  (1,T].  On  this  interval  for 
the  blitz  schedule  in  (1),  ad  quality  evolution  doesn  t 
matter  since  we  have  g(0)=0,  by  assumption  (see  Mahajan  and 
Muller  1986) . Hence,  the  decay  of  awareness  in  the  absence 
of  advertising  is  obtained  by  integrating  the  o.d.e.  dA/dt  — 

-5  A (t) . Hence,  we  get, 

A2(t)  = k e_St 

where  k is  arbitrary  constant  determined  by  the  continuity 
condition  A Al)  = A 2(1).  Therefore,  k = A,(l)  edl  . Thus,  we 

get,  awareness  evolution  over  the  period  (1,T],  as 

— 8(t  - 1) 


A2(t)  = Aj  (1)  e 


(E . 4 ) 
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Together,  (E.3)  and  (E.4)  determine  the  awareness 
evolution  due  to  a blitz  schedule  of  duration  1,  in  the 
presence  of  ad  wearout. 


APPENDIX  F 

AWARENESS  PEAKS  AT  t* 


This  result  is  obtained  by  finding  the  location  of  the 
maximum  awareness  level  of  equation  (A2.1) . On 
differentiating  (A2.1)  with  respect  to  argument  t,  we  get, 

= _J!®_  (e-a(u)t  (_a(u)  ) - e-St  (-8) ) (F.l) 

dt  8 - a(u) 


Setting  dAi/dt  = 0 and  solving  the  resulting  first 
order  condition,  yields  the  following  unique  solution  for 
the  stationary  point, 


, In  8 - In  a(u) 
t = 


(F.2 


8 - a(u) 

Since  t*  is  the  only  root  of  the  first  order  condition 
dA/dt  = 0,  the  location  of  the  peak  is  unique  and  global. 

That  Ai(t)  in  (2.1)  has  a peak  at  t*  is  verified  from 
the  curvature  characteristics.  On  differentiating  (F.2) 
w.r.t.  t,  we  get, 

& A(t)  g(u) 

— ie  - e --  v-w, 

(F.3) 


at2 


8 - a(u) 
g(u)  e“a(C,t 


(e"au)t  (-a(u)2  - e“^t  (-S)2 
( (a(u)2  - e',8~a(5),t  (6)2) 


8 - afu) 

Substituting  t*  from  (F.2)  in  (F.3),  and  on  simplifying  the 
algebra,  we  get, 
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ff  A 
dt2 


g^-- ( (a(u)  f - 52) 

8 - a(u) 

g(u)  e’a(n'a)t 


5 - a(u) 
g(u)  e-3®1 


( (a(u)  ) - 


a(u) 


5 ) 


5 - a(u) 

-g(u)  e“a(n'U|t  afu) 


(a(u)  - 8)a(u) 


< 0 


since  g(.)  > 0,  the  exponential  function  is  always  positive, 
and  wearout  rate  a(.)  > 0.  Hence,  Ai(t)  in  (2.1)  has  a 


unique  peak  at  t* . 


APPENDIX  G 

MONOTONICITY  OF  AWARENESS  EVOLUTION  IN  THE  ABSENCE  OF 

WEAROUT 


The  absence  of  ad  wearout  implies  a(u)  is  zero.  In 
this  case,  cSMtJ/dt  is  strictly  positive,  as  can  be  seen 
below: 

3Aj_  _ {e-a(u)t  (_a(u)  ) - e~5t  (-5)  ) 

dt  8 - a(u) 

— = ^ (-  e"8t  (-6)  ) 

St  8 

= g(u)  e“^t 
> 0 

Hence,  the  monotonicity  of  (2.1)  in  absence  of  ad 
wearout . 
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APPENDIX  H 

ON  THE  ONSET  OF  WEAROUT 


We  wish  to  understand  how  t*  behaves  as  ad  wearout 
increases.  From  F.2  in  Appendix  F,  t*  is  a known  function  of 
wearout  a(u).  Also  a(u)  = c + w u,  hence  da/dc  = 1/  and 
da/ dw  = u . 

Copy  Wearout.  Consider  copy  wearout  parameter  c. 


at*  _ at*  aa(u)  _ at* 
ac  aa(u)  dc  3a(u) 

1 6 

(5  - a(u)  ) (0  - — ) - Ln  — (0  - 1) 
a(u) a(u) 

= (5  - a(u)  )2 

5 - a(u)  S(u) 

+ Ln 


a(u) 


a(u) 


(8  - a(u)  )2 

at*  . _ 8(u)  8(u)  . „ 

Sign(-r— ) = Sign(Ln  • - ~~zr  + 1)  • 
y 3c  a(u)  a(u) 

The  term  on  the  right-hand  side  parenthesis  is  of  the 
type  G (x)  = Ln(x)-  x + 1.  It  is  easy  to  verify  that  Ln(x) 
will  always  lie  below  (x-1)  for  all  x > 1,  because  G' (x)  = 
1/x  - l is  negative  over  x e (1,°°)  . Hence,  Sign  (at  /dc)  is 
negative  for  8 > a(u)  . 

Similarly,  for  8 < a(u),  t = Ln  (a  ( u ) /8)  / (a  ( u ) - 8), 
from  (F.2)  in  appendix  F.  A similar  analysis  as  above  will 
lead  to  the  conclusion  that  Sign(at*/ac)  is  negative. 
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Consequently,  regardless  of  the  relative  magnitudes  of 
8 and  a(u),  Sign(dtVdc)  is  negative.  Therefore,  the  onset 
of  decline  in  awareness  is  delayed  as  copy  wearout 
decreases,  as  required. 

Repetition  Wearout.  Consider  0t*/dw.  Here,  dt*/dw  = 
(dt*/da)  (da/dw)  = (dt7da)u.  Since  u is  the  spending  rate 
when  advertising  is  on,  so  u > 0.  Note  that  dt  /da  = dt  /dc, 
in  the  above  analysis.  Hence,  Sign(dtVdw)  is  also  always 
negative.  That  is,  t*  decreases  as  w increases. 

Hence,  the  onset  of  decline  in  awareness  is  delayed  as 
repetition  wearout  decreases,  as  required. 


APPENDIX  I 

REVENUES  FOR  A BLITZ  SCHEDULE 


For  a blitz  schedule  in  (1),  the  awareness  evolution  is 
given  by  (2. 1-2. 2)  in  chapter  4.  Revenues  are  obtained  by 
integrating  the  area  under  the  awareness  curve  over  time 
[0,oo],  as  in  Mahajan  and  Muller  (1986)  for  example.  Hence, 
revenues  are  given  by  R(l)  below: 


RID  = 1 A1(t)dt  + I A2  (t)dt 


g(u)  J 

8 - afu)  o 


i 
-at 


(e~at  - e”6t  )dt  + J A0e“8tdt  + J A,CDe 


I A, 


g(u) 


-at 


8 - afu)  -a 
g(u)  1 - 


1 e”st 

J e"8t 

) + A0( 

0 

-8 

0 

-S(t-l) 


) + A,  (1)- 


-a  1 


1 - e 


-Si 


8 - a(u) 


")  + A0 


1 - e 


-hi 


8 


+ AX(D  - 


g(u)  1 - e 


-ai 


1 - e~8J  1 - e 

; ) + A0  — 


8 - a(u) 

+ 0 


-8J 


g(u)  i_-al  .-hi,  . , .-hi, 


(e  - e-6J)  + A0e  ) - 
8 - a(u)  o 


g(u)  1 - e 


-ai 


8 - a(u) 

1 - e 


1 - e”8J  e 
r + 


-ai 


-5i 


+ A0(- 


e 

+ -=-> 


g(u) 


8 - afu) 


1111  _aJ 
(~  - r)  - - T)e 

. a 8 a 8 


g(u)  [ 8 - afu) 


8 - afu)  V 8a(u)  / 


(1  - e~aJ)  + y 
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APPENDIX  J 

SUPERIORITY  OF  AN  EVEN  SCHEDULE  WHEN  NO  AD  WEAROUT 


In  absence  of  ad  wearout,  the  revenues  are  obtained  by 
taking  limits  of  R(I)  as  wearout  rate  approaches  zero. 
(Alternatively/  the  differential  equation  in  E.l  can  be 
solved  for  a=0,  and  the  area  under  its  awareness  evolution 
can  be  obtained  by  integration  as  in  appendix  I . ) . Revenues 
as  a function  of  blitz  duration  1 is  given  by  equation  (5) 
in  chapter  4.  Taking  limits  by  using  L'Hopital's  rule, 


R(i;  no  wearout)  = lim  R(l) 

a— >0 


. ,(u)  0 - e-Ji 

= lim  — — ( : ) + U 

a->0  O 1 

1 . u®  -ai  1 

= lim  — r—  e 1 

a — >0  O 

g(u)  , 


To  maximize  revenues/  the  first  order  condition  for  the 
above  R(l)  is 

dR  1 dq  8u  g(u) 

dl  5 du  dl  5 


B 


g(u) 


s 0 


= - g'(u)  (-  y)  + 

B 

=>  g'(u)  (-)  = g(u) 

^ g® 

=>  g'(u)  = -zr- 
u 

In  the  above  analysis/  I have  used  the  chain  rule  of 

du_  _ _B_ 

differentiation  and  the  fact  that  u = B/l  and  ^ ^ * 
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In  the  first  order  condition  above,  the  last  equality 
states  that  the  slope  of  g(.)  at  x*  must  equal  the  value  of 
the  function  g(x*)  divided  by  the  distance  x . Hence,  the 
maximum  is  located  at  the  point  where  a tangent  through  the 
origin  touches  the  curve  g(x).  Graphically,  this  condition 

is  shown  below. 


slope  of  tangent 
g (x)  line  = g'  (x*) 


So  when  g(.)  is  convex,  as  shown  above,  then  chattering 
between  0 and  x will  yield  the  maximum  R . 

However,  for  concave  g ( . ) , a.  tangent  through  the  origin 
which  touches  g(x)  is  at  the  origin  itself.  The  dotted  line 
below  shows  that  some  arbitrary  line  can  be  the  tangent  only 
close  to  the  origin  point,  and  nowhere  else. 

A tangent  through 
the  origin 

/ g(x) 


X 
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Therefore,  the  maximum  is  at  x*  -»  0.  But  x is  the 
spending  rate  B/l.  Hence,  the  smallest  x implies  the  largest 
1,  which  is  i*  = T on  a finite  planing  horizon.  Hence,  an 
even  spending  schedule  is  the  optimal  strategy  when  there  is 
diminishing  returns  to  advertising  effort  and  no  ad  wearout. 


APPENDIX  K 
SUPERIORITY  OF  BLITZ 


From  equation  (5)  in  chapter  4,  the  revenues  for  a 
blitz  schedule  in  the  presence  of  ad  wearout  and  concave 
response  to  advertising  effort  is  given  by 


where  ad 
rate  u = 


RU)  = 


g(u) 

5a(u) 


(1  _ e-a(S)J)  + 


Ar 


wearout  rate  a (u)  = c + w u,  the  spending 
— , and  g (u)  is  any  concave  function  with  the 


properties  g(0)=0,  dg/du.  > 0,  d2g/du  < 0. 

To  demonstrate  that  there  exists  a best  blitz  schedule. 


I have  to  show  that  there  is  some  revenue  R(l*)  which  is  the 
largest  for  some  blitz  schedule  with  length  1 . That  is, 
equivalently,  R [1)  has  an  inverted- U shape,  as  shown  in 


figure  below. 


RU) 
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Now  let  Ao=0  and  6=1,  without  loss  of  generality  since 
both  Ao  and  6 are  constants  that  do  not  depend  on  i . On  re- 
arranging R(l)  in  (K.l), 

g(u)  M -<c+w\i 

ru)  = % a - e J ) 

c + w — 


Ig(S)  a-e-e^l 


cl  + Bw 
lg(u) 


(K.2) 


n a - ae  ) 
cl  + P 

where  a,  P are  some  positive  constants. 

From  (K.2)  it  can  be  verified  that  R(0)  = 0.  This 
follows  from  taking  limits,  as  shown  below. 


R(0)  = lim  — g(u)  a - ae"cj) 

j->o  cl  + P 

= — g(oo)  (1  - a)  ( K . 3 ) 

0 + p 

= 0 

Since  g (u)  is  bounded  from  above,  g(Q°)  is  a positive 

constant;  hence  R(0)=0. 

Similarly,  R(oo)=0  from  (K.2),  since 


R(qo)  = lim 


J->00  cl  + P 


g(u)  (1  - ae  cj) 


= lim 

J->co 


g(0)  (1  - ae  °J) 


(K . 4 ) 


c 

= 0. 


g(0)  a - 0) 


The  last  equality  follows  from  the  fact  that  g(0)  0. 
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Finally,  R (1)  is  increasing  in  1 at  the  origin.  This 
can  be  verified  by  differentiation,  but  more  simply,  this 
follows  by  noting  that  R(I)  at  1=1  is  greater  than  R(0) . 

1 -c, 

lim  r (I)  = g(B)  (1  - ae  ) 

j->i  c + (3 

> 0 = R(0)  (K.5) 

R(i)  > R(0) 

Together  from  (K.3-K.5),  R(l)  is  initially  increasing 
and  then  tending  to  zero  at  infinity.  Hence,  R(l)  has  an 
inverted-U  shape,  as  required.  The  numerical  example  in 
chapter  4 illustrates  this  fact  in  concrete  terms. 


APPENDIX  L 

DERIVATION  OF  AD  QUALITY  DYNAMICS 


Ad  quality  evolution  is  given  by  the  following 
equation: 

^ = -a(u)q  + (1  - I(u)  )5(1  - q)  . (L-D 

dt 

(A)  Consider  the  first  burst,  i.e.  0<  t < h.  Here 

I(u)=l  since  ads  are  on.  Hence,  = -a(u)q  ; its  solution 

is  given  by 

q(t)  = e'a(irit  , during  te  [0,h)  (L.2) 

assuming  initial  ad  quality  is  q(0)=l. 

(B)  Consider  the  first  hiatus,  i.e.  h < t < li+ti.  Here 

I (u)  =0  since  ads  are  off.  Thus,  = -(c  + w x 0)q  + 8(1  - q) 

which  is  equivalent  to  — = 5 - (c  + 5)q  . Denote  (c+8)  = p. 

dt 

Then,  upon  using  the  method  of  integrating  factor,  the 
solution  to  this  ordinary  differential  equation  is  given  by 

q(t)  = q(lj)  )e'p(t'J,)  , for  t e Ui,  h+'h)  , (L.3) 

P P 
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where  q (1,)  = e~a(u)J'  (which  is  also  denoted  as  q:  in  the  main 
text  and  figure) . 

(C)  Consider  the  second  burst,  i.e.  Ii+Ti  < t < 
ii+Ti+iz.  Here  I(u)=l.  Thus,  = -a(u)q  + 0 ; its  solution  is 

given  by 


q(t)  = qUj  + X.)  e-a(n,(t-J'+T|)  , t e (li+ Tx.  li+Ti+Ia), 


(L . 4 ) 


where  q(Ii+Ti),  also  denoted  as  q2,  is  obtained  as  follows: 


8 .6 


■ <««. + *■>  ■ p - - q',e"'  p = c + 8- 


(D)  Consider  the  second  hiatus,  i.e.  h+Xi+l2  < t < T. 

Here  l(u)=0.  Thus,  — = 6 - (c  + 8)q  ; its  solution  was 

dt 

obtained  earlier.  Using  the  appropriate  initial  condition  to 
maintain  continuity,  we  get. 


8 ,5  , . . . . 

q(t)  = — — ( — q(-i)  + + i2)  )& 

p p 


(L . 5) 


for  the  period  t e (ii+Xi+I2,  T]  . The  term  q[h+Xi+l2)  is  also 
denoted  as  q3  which  is  obtained  as 


q3 


= q(I,  + x,  + 12)  - e 


-a(u)J2 


APPENDIX  M 

DERIVATION  OF  AWARENESS  DYNAMICS 
Awareness  evolution  is  given  by  the  following  equation: 

= q(t)g(u)  - 8A  . (M.l) 

dt 

(A)  Consider  the  first  burst,  i.e.  0 < t < h.  Here  ad 
quality  evolution  is  given  by  q(t)  = e a(u)  , during  te 
[0,li),  from  appendix  L.  Substituting  q(t)  in  (M.l),  the 
resulting  differential  equation  is  solved  by  using  the 
method  of  integrating  factors.  The  solution  is  given  by: 

A(t)  = 9(u)_-  {e~alg)t  - e”8t} , t e [0,1,)  (M.2) 

8 - a(u) 

(B)  Consider  the  first  hiatus,  i.e.  li  < t < 1 i+Ti . Here 

since  u(t)=0  during  media  hiatus,  which  implies  g(u)=0,  the 

dA 

awareness  evolution  gets  simplified  to  — = -8A  regardless 
of  q(t) . Hence,  the  solution  is  the  classic  decay  equation: 

A(t)  = A(l, )e”S(t_J,),  t e (!,,!,  + x,)  (M.3) 

The  awareness  level  A(li)  , also  denoted  as  Ax,  is  obtained 
from  (M.2)  as  follows. 
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A, 


= A(lj)  = 


g(u)  , -a(u)J, 

8 - a(u) 


e 


-81, 


} 


(C)  Consider  the  second  burst,  i.e.  li+ti  < t < 
ii+Xi+12-  Here  ad  quality  evolution  is  given  by  the 
expression  in  (L.4),  namely, 

q(t)  = qU,  + x,)  , t e (li+ti.  I1+T1+I2)  . 

Substituting  in  (M.l),  and  upon  integration, 

A(t)  = e“8t  J e5tq(t)g(u)dt  + (const)  e'8t 

= e~8tJ  e8tq2e~a(t_T|g(u)dt  + (const)  e“8t,  where  1 = i,  + t, 

= e'8tq2eaTg(u)J  e(8'a(“ntdt  + (const)  e"8t 

(8-a(u)  )t 

= e”8tq2ealg(u)  [ (fi  _ a(=fj]  + (const)  e 

= q2g(^  - e"aliI)  (t'T)  + (const)  e"8t 

(8  - a(u)  ) 

To  determine  the  constant  of  integration,  consider  the 
continuity  at  t=Ii+Tx.  Then,  continuing  from  the  above 
expression, 

A/7  . T \ - q2g(u) e-a(S)(oi  + (Const)  e'8u,+T,)  and,  from  (M3)  , 

AU>  + xi;  “ (5  - a(u)  ) 

A(i,  + x,)  = A(i1)e'8(J,+T,_Jl) 

Hence,  upon  equating  the  above  two  expressions,  the  constant 
of  integration  can  be  determined.  Thus,  the  awareness 
evolution  is  given  by. 
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A(t)  _ q2g(u)  {c~a(u) (t-D  - e-8(t_1)}  + A2e  8(t  11  (M.4) 

8 - a(u) 

where  1 = I,  + x,  and  A2  = A(l,)e  1 . 

(D)  Consider  the  second  hiatus,  i.e.  li+Xi +i2  < t < T. 
The  ad  quality  evolution  does  not  influence  awareness 
evolution  because  when  u(t)=0,  g(u)=0.  Thus  awareness  decays 

according  to  — = — 5A  , whose  solution,  with  the 
dt 

appropriate  continuity  conditions,  is  given  by  the 
following: 


A(t)  = A3e 


— S(t— J,  +1,  +Jj) 


where  A3  = A (1,  + x,  + 12) 

= q2g(u)  {e-a(uua 
5 - a(u) 


-6J,  , 

e 2} 


+ 


A2e 


-8J, 


(M.5) 


APPENDIX  N 

DERIVATION  OF  REVENUES  UNDER  TWO  PULSE  SCHEDULES 


In  appendix  M the  awareness  evolution  was  determined. 

To  obtain  the  revenue  generated  by  a two-pulse  schedule,  the 
area  under  awareness  curves  is  computed  by  integrating  the 
awareness  curves  over  t e[0,  oo)  , i.e.  revenue  function  is 

R = j A(t)dt  . (N.  1) 

0 


(A)  Consider  the  first  burst,  i.e.  0 < t < 1 i.  Here 
awareness  evolution  is  given  by  (M.2)  as  follows. 

Wt)  = g(g|_-  (e-*1''’1  - e"*),  t s [0,  1,) 

6 - a(u) 

The  area  under  the  above  A(t)  is  given  by 


R. 


j g(u)  re-a(u)t 

| 8 - a(u) 

g(u)  

5 - a(u)  Va(u) 


- e-5t}dt 


e-a(u  )-0 

a(u)  > 


(N . 2 ) 


(B)  Consider  the  first  hiatus,  i.e.  h < t < li+ti. 
Here,  from  (M.3),  the  awareness  evolution  is  given  by 
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A(t)  = A (l,)e_8tt  2,) , which  when  integrated  over  t e (lx,  lx  + 
we  get, 


r2  = j AU1)e-8(t-J‘,dt 


(N.3) 


= A(i, ) 


(1  - e-Sl' ) 


(C)  Consider  the  second  burst,  i.e.  ii+Xi  < t < 
h+ T1+I2.  Here,  the  awareness  evolution  is  given  by,  (M.4), 
as  follows: 

A(t)  _ [c-a(u)(t-T)  _ e-«(t-T)}  + A2e"5(t_1) 

5 - a(u) 

where  I = 1,  + T,  and  A2  = A (1^'  . The  revenue  generated 

during  this  period  is 


J,+T,+Jj 

r3  = J A(t)dt 

Ji+*>  (N.  4 ) 


r * - \ 

( -8J,  -a(u)Jj  ^ 

q2g(u) 

1 - 

+ 

e e 

5 - a(u) 

va(u)  8J 

L 5 a(u) 

(D)  Consider  the  second  hiatus,  i.e.  I1+T1+I2  < t < T. 
The  awareness  curve  is  given  by  (M.5)  whose  integral  over 

this  time  period  is 


R4 


J A(t)dt 

J,  + t,  + J2 


(1  - e’8*1 ) 


(N.5) 


5 
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(E)  The  salvage  value  of  the  terminal  awareness  level 


is  given  by 

00 

r5  = J A(t)dt 

T 

= f A(T)e'S(t'T,dt 

JT  (N.  6) 

A(T) 

5 

T,  -8lj 

A3e  2 
5 

(F)  Thus, 

the  total  revenue  is  R = Ri  + R2  + R3  + Ri  + R5 

which  is  a complicated,  but  closed-form,  function  of  the 
defining  parameters  of  two-pulse  schedule,  namely,  li,  I2 < 
/ and  t>2  • 
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